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Description 

[0001] This invention relates to newly identified polynucleotides, polypeptides encoded by such polynucleotides, the 
use of such polynucleotides and polypeptides, as well as the production and isolation of such polynucleotides and 
5 polypeptides. More particularly, the polynucleotides and polypeptides of the present invention have been putatively 
identified as esterases. Esterases are enzymes that catalyze the hydrolysis of ester groups to organic acids and alco- 
hols. 

[0002] Many esterases are known and have been discovered in a broad variety of organisms, including bacteria, 
yeast and higher animals and plants. A principal example of esterases are the lipases, which are used in the hydrolysis 

io of lipids, acidolysis(replacement of an esterified fatty acid with a free fatty acid) reactions, transesterif ication(exchange 
of fatty acids between triglycerides)reactions, and in ester synthesis. The major industrial applications for lipases in- 
clude: the detergent industry, where they are employed to decompose fatty materials in laundry stains into easily 
removable hydrophilic substances: the food and beverage industry where they are used in the manufacture of cheese, 
the ripening and flavoring of cheese, as antistaling agents for bakery products, and in the production of margarine and 

15 other spreads with natural butter flavors; in waste systems; and in the pharmaceutical industry where they are used 
as digestive aids. 

[0003] The polynucleotides and polypeptides of the present invention have been identified as esterases as a result 
of their enzymatic activity. 

[0004] In accordance with one aspect of the present invention, there are provided novel enzymes, as well as active 
20 fragments, analogs and derivatives thereof. 

[0005] In accordance with another aspect of the present invention, there are provided isolated nucleic acid molecules 
encoding the enzymes of the present invention including mRNAs, cDNAs, genomic DNAs as well as active analogs 
and fragments of such enzymes. 

[0006] In accordance with yet a further aspect of the present invention, there is provided a process for producing 
25 such polypeptides by recombinant techniques comprising culturing recombinant prokaryotic and/or eukaryotic host 
cells, containing a nucleic acid sequence of the present invention, under conditions promoting expression of said en- 
zymes and subsequent recovery of said enzymes. 

[0007] In accordance with yet a further aspect of the present invention, there is provided a process for utilizing such 
enzymes, or polynucleotides encoding such enzymes for hydrolyzing ester groups to yield an organic acid and an 
30 alcohol. The esterases of the invention are stable at high temperatures and in organic solvents and, thus, are superior 
for use in production of optically pure chiral compounds used in pharmaceutical, agricultural and other chemical indus- 
tries. 

[0008] In accordance with yet a further aspect of the present invention, there are also provided nucleic acid probes 
comprising nucleic acid molecules of sufficient length to hybridize to a nucleic acid sequence of the present invention. 
35 [0009] In accordance with yet a further aspect of the present invention, there is provided a process for utilizing such 
enzymes, or polynucleotides encoding such enzymes, for in vitro purposes related to scientific research, for example, 
to generate probes for identifying similar sequences which might encode similar enzymes from other organisms by 
using certain regions, i.e., conserved sequence regions, of the nucleotide sequence. 

[0010] These and other aspects of the present invention should be apparent to those skilled in the art from the 
40 teachings herein. 

[001 1] The following drawings are illustrative of embodiments of the invention and are not meant to limit the scope 
of the invention as encompassed by the claims. 

[0012] Figure 1 is an illustration of the full-length DNA (SEQ ID NO:23) and corresponding deduced amino acid 
sequence (SEQ ID NO:33) of Staphyfothermus marinus F1 -1 2LC of the present invention. Sequencing was performed 
45 using a 378 automated DNA sequencer (Applied Biosystems, Inc.) for all sequences of the present invention. 

[0013] Figure 2 is an illustration of the full-length DNA (SEQ ID NO:24) and corresponding deduced amino acid 
sequence (SEQ ID NO:34) of Pyrodictium TAG 11-17LC. 

[0014] Figure 3 is an illustration of the full-length DNA (SEQ ID NO:25) and corresponding deduced amino acid 
sequence (SEQ ID NO:35) of Archaeoglobus venificus SNP6-24LC. 
so [0015] Figure 4 is an illustration of the full-length DNA (SEQ ID NO:26) and corresponding deduced amino acid 
sequence (SEQ ID NO:36) of Aquifex pyrophilus-2QLC. 

[0016] Figure 5 is an illustration of the full-length DNA (SEQ ID NO:27) and corresponding deduced amino acid 
sequence (SEQ ID NO:37) of M11TL-29L. 

[0017] Figure 6 is an illustration of the full-length DNA (SEQ ID NO:28) and corresponding deduced amino acid 
55 sequence (SEQ ID NO:38) of Thermococcus CL-2-30LC. 

[0018] Figure 7 is an illustration of the full-length DNA (SEQ ID NO:29) and corresponding deduced amino acid 
sequence (SEQ ID NO:39) of Aquifex VF5-34LC. 

[0019] Figure 8 is an illustration of the full-length DNA (SEQ ID NO:30) and corresponding deduced amino acid 
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sequence (SEQ ID NO:40) of Teredinibacter~42L. 

[0020] Figure 9 is an illustration of the full-length DNA (SEQ ID NO:31) and corresponding deduced amino acid 
sequence (SEQ ID NO:41) of Archaeoglobus fulgidus VC16-16MC. 

[0021] Figure 10 is an illustration of the full-length DNA (SEQ ID NO:32) and corresponding deduced amino acid 
s sequence (SEQ ID NO:42) of Sulfolobus solfataricus P1 -8LC. 

[0022] Figure 11 is an illustration of the full-length DNA (SEQ ID NO:43) and corresponding deduced amino acid 
sequence (SEQ ID NO:53) of LA11.1 Esterase es2 of the present invention. 

[0023] Figure 12 is an illustration of the full-length DNA (SEQ ID NO:44) and corresponding deduced amino acid 
sequence (SEQ ID NO:54) of Whale Mat Sample 11.801 Esterase es9. 
w [0024] Figure 13 is an illustration of the full-length DNA (SEQ ID NO:45) and corresponding deduced amino acid 
sequence (SEQ ID NO:55) of Metallosphaera Prunae Ron 12/2 Esterase 23mc1 . 

[0025] Figure 14 is an illustration of the full-length DNA (SEQ ID NO:46) and corresponding deduced amino acid 
sequence (SEQ ID NO:56) of Thermotoga. neapolitana 5068 Esterase 56mc4. 

[0026] Figure 15 is an illustration of the full-length DNA (SEQ ID NO:47) and corresponding deduced amino acid 
is sequence (SEQ ID NO:57) of Mefittangium lichenicola Esterase 77mc1 . 

[0027] Figure 16 is an illustration of the full-length DNA (SEQ ID NO:48) and corresponding deduced amino acid 
sequence (SEQ ID NO:58) of Whale Mat Sample 11.801 Esterase es2. 

[0028] Figure 17 is an illustration of the full-length DNA (SEQ ID NO:49) and corresponding deduced amino acid 
sequence (SEQ ID NO:59) of Whale Mat Sample AD3059 Esterase es4. 
20 [0029] Figure 18 is an illustration of the full-length DNA (SEQ ID NO:50) and corresponding deduced amino acid 
sequence (SEQ ID NO:60) of Microscilla furvescens Esterase 53sc2. 

[0030] figure 19 is an illustration of the full-length DNA (SEQ ID NO:51) and corresponding deduced amino acid 
sequence (SEQ ID NO:61) of Thermotoga maritima MSB8 Esterase 6sc1 . 

[0031] Figure 20 is an illustration of the full-length DNA (SEQ ID NO:52) and corresponding deduced amino acid 
25 sequence (SEQ ID NO:62) of Polyangium brachysporum Esterase 78mc1 . 

[0032] The term "gene" means the segment of DNA involved in producing a polypeptide chain; it includes regions 
preceding and following the coding region (leader and trailer) as well as intervening sequences (introns) between 
individual coding segments (exons). 

[0033] A coding sequence is "operably linked to" another coding sequence when RNA polymerase will transcribe 
30 the two coding sequences into a single mRNA, which is then translated into a single polypeptide having amino acids 
derived from both coding sequences. The coding sequences need not be contiguous to one another so long as the 
expressed sequences ultimately process to produce the desired protein. 

[0034] 'Recombinant" enzymes refer to enzymes produced by recombinant DNA techniques; i.e., produced from 
cells transformed by an exogenous DNA construct encoding the desired enzyme. "Synthetic" enzymes are those pre- 
ss pared by chemical synthesis. 

[0035] A DNA "coding sequence of or a "nucleotide sequence encoding" a particular enzyme, is a DNA sequence 
which is transcribed and translated into an enzyme when placed under the control of appropriate regulatory sequences. 
[0036] In accordance with an aspect of the present invention, there are provided isolated nucleic acids (polynucle- 
otides) which encode for the mature enzymes having the deduced amino acid sequences of Figures 1-20 (SEQ ID 
40 NOS:23-32 and 43 to 52.) 

[0037] The deposit(s) have been made under the terms of the Budapest Treaty on the International Recognition of 
the deposit of micro-organisms for purposes of patent procedure. The strains will be irrevocably and without restriction 
or condition released to the public upon the issuance of a patent. These deposits are provided merely as convenience 
to those of skill in the art and are not an admission that a deposit would be required under 35 U.S.C. §112. The 
45 sequences of the polynucleotides contained in the deposited materials, as well as the amino acid sequences of the 
polypeptides encoded thereby, are controlling in the event of any conflict with any description of sequences herein. A 
license may be required to make, use or sell the deposited materials, and no such license is hereby granted. 
[0038] The polynucleotides of this invention were originally recovered from genomic gene libraries derived from the 
following organisms: 

so [0039] Staphylothermus marinus F1 is a thermophilic sulfur archaea which was isolated in Vulcano, Italy. It grows 
optimally at 85°C (T^ = 98°C) at pH 6.5. 

[0040] Pyrodictium TAG 11 is a thermophilic sulfur archaea which was isolated in the Middle Atlantic Ridge. It grows 
optimally at 103°C (T max = 110°C) at pH 6.5. 

[0041] Archaeoglobus venificus SNP6 was isolated in the Middle Atlantic Ridge and grows optimally at 75°C (T max 
55 = 92°C) at pH 6.9. 

[0042] Aquifex pyrophitus K01 5a was isolated at Kolbeinsey Ridge, North of Iceland. This marine organism is a 
gram-negative, rod-shaped, strictly chemolithoautrophic, knall gas bacterium. It grows optimally at 85°C {Tm^ - 95°C) 
at pH 6.8. 


4 


EP 0 880 590 B1 


[0043] M11TL is a new species of Desulfurococcus which was isolated from Diamond Pool (formerly Jim's Black 
Pool) in Yellowstone. The organism grows heterotrophically by fermentation of different organic materials (sulfur is not 
necessary) in grape-like aggregates optimally at 85 - 88°C in a low salt medium at pH 7.0. 

[0044] Thermococcus CL-2 was isolated in the North Cleft Segment of the Juan de Fuca Ridge from a severed 
5 atvinellid worm residing on a 'black smoker" sulfide structure. This marine archaea forms pleomorphic cocci, and grows 
optimally at 88°C. 

[0045] AquifexVFS was isolated at a beach in Vulcano, Italy. This marine organism is a gram-negative, rod-shaped, 
strictly chemolithoautotrophic, knall gas bacterium. It grows optimally at 85°C (T max = 95°C) at pH 6.8. 
[0046] Teredinibacter (pure) is an endosymbiont of the shipworm Bankia gouldi. The organism has straight to slightly 
10 bent 5-10 u.m rods, and forms spiral cells as stationary phase is met. The organism was described in Science (1983) 
22:1401-1403. It grows optimally at 30°C at pH 8.0. 

[0047] Archaeoglobus fulgidus VC\6 was isolated in Vulcano, Italy. The organism grows optimally at 85°C (T max = 
92°C) at pH 7.0. 

[0048] Sutfofobus solfataricus P1 grows optimally at 85°C (T max = 87°C) at pH 2.0. 

15 [0049] Accordingly, the polynucleotides and enzymes encoded thereby are identified by the organism from which 
they were isolated, and are sometimes hereinafter referred to as F1/12LC (Figure 1 and SEQ ID NOS:23 and 33), 
TA611/17LC (Figure 2 and SEQ ID NOS:24 and 34), SNP6/24LC (Figure 3 and SEQ ID NOS:25 and 35), AqP/28LC 
(Figure 4 and SEQ ID NOS:26 and 36), M11TL/29L (Figure 5 and SEQ ID NOS:27 and 37), CL-2/30LC (Figure 6 and 
SEQ ID NOS:28 and 38), VF5/34LC (Figure 7 and SEQ ID NOS:29 and 39), Trb/42L (Figure 8 and SEQ ID NOS:30 

20 and 40). VC16/16MC (Figure 9 and SEQ ID NOS:31 and 41) and P1/8LC (Figure 10 and SEQ ID NOS: 32 and 42), 
[0050] The polynucleotides and polypeptides of the present invention show identity at the nucleotide and protein 
level to known genes and proteins encoded thereby as shown in Table 1 . 


Table 1 


Enzyme 

Gene w/closest Homology 
(Organism) 

Protein Similarity 
(%) 

Protein Identity 
(%) 

DNA Identity 
(%) 

F1/12LC 

No significant homology 




TAG11/17LC 

No significant homology 




SNP6/24LC 

PIR S34609 - 
carboxylesterase 
Pseudomonessp. (strain KWI- 
56) open reading frame of 
unknown function in E.colL 

46 

27 

42 

AqP/29l_C 


53 

31 

38 

M11TL729LC 

No significant homology 




CL02/30LC 

No significant homology 




VF5/34LC 

Identified by homology to 
28LC; also homologous to 
ORF of unknown function 5' of 
tgs in E. coli 

84 

71 

71 

Trb/42L 

No significant homology 




P1-8LC 





VC16-16MC 






so [0051] All the clones identified in Table 1 encode polypeptides which have esterase activity. 

[0052] This invention, in addition to the isolated nucleic acid molecules encoding the enzymes of the present inven- 
tion, also provides substantially similar sequences. Isolated nucleic acid sequences are substantially similar if: (i) they 
are capable of hybridizing under conditions hereinafter described, to the polynucleotides of SEQ ID NOS:23-32 and 
43-52; (ii) or they encode DNA sequences which are degenerate to the polynucleotides of SEQ ID NOS:23-32 and 43 

55 to 52 Degenerate DNA sequences encode the amino acid sequences of SEQ ID NOS:33-42 and 53 to 62, but have 
variations in the nucleotide coding sequences. As used herein, substantially similar refers to the sequences having 
similar identity to the sequences of the instant invention. The nucleotide sequences that are substantially the same 
can be identified by hybridization or by sequence comparison. Enzyme sequences that are substantially the same can 
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be identified by one or more of the following: proteolytic digestion, gel electrophoresis and/or microsequencing. 
[0053] One means for isolating the nucleic acid molecules encoding the enzymes of the present invention is to probe 
a gene library with a natural or artificially designed probe using art recognized procedures (see, for example: Current 
Protocols in Molecular Biology, Ausubel F.M. et ai (EDS.) Green Publishing Company Assoc. and John Wiley Inter- 
s science, New York, 1989, 1992). It is appreciated by one skilled in the art that the polynucleotides of SEQ ID NOS: 
23-32 and 43 to 52 or fragments thereof (comprising at least 1 2 contiguous nucleotides), are particularly useful probes. 
Other particularly useful probes for this purpose are hybridizable fragments of the sequences of SEQ ID NOS:1-22 (/. 
e., comprising at least 12 contiguous nucleotides). 

[0054] With respect to nucleic acid sequences which hybridize to specific nucleic acid sequences disclosed herein, 
10 hybridization may be carried out under conditions of reduced stringency, medium stringency or even stringent condi- 
tions. As an example of oligonucleotide hybridization, a polymer membrane containing immobilized denatured nucleic 
acids is first prehybridized for 30 minutes at 45°C in a solution consisting of 0:9 M NaCI, 50 mM NaH 2 P0 4 , pH 7.0. 5.0 
mM Na 2 EDTA, 0.5% SDS, 10X Denhardt's, and 0.5 mg/mL polyriboadenylic acid. Approximately 2 X 10 7 cpm (specific 
activity 4-9 X 10 8 cpm/ug) of 32 P end-labeled oligonucleotide probe are then added to the solution. After 12-16 hours 
15 of incubation, the membrane is washed for 30 minutes at room temperature in IX SET (150 mM NaCI, 20 mM Tris 
hydrochloride, pH 7.8, 1 mM N^EDTA) containing 0.5% SDS. followed by a 30 minute wash in fresh 1X SET at Tm 
-10°C for the oligo-nucleotide probe. The membrane is then exposed to autoradiographic film for detection of hybrid- 
ization signals. 

[0055] Stringent conditions means hybridization will occur only if there is at least 90% identity, preferably at least 
20 95% identity and most preferably at least 97% identity between the sequences. See J. Sambrook et at., Molecular 
Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory (1 989) which is hereby incorporated by reference 
in its entirety. 

[0056] As used herein, a first DNA (RNA) sequence is at least 70% and preferably at least 80% identical to another 
DNA (RNA) sequence if there is at least 70% and preferably at lest a 80% or 90% identity, respectively, between the 
25 bases of the first sequence and the bases of the another sequence, when property aligned with each other, for example 
when aligned by BLAST N. 

[0057] The present invention relates to polynucleotides which differ from the reference polynucleotide such that the 
changes are silent changes, for example the change do not alter the amino acid sequence encoded by the polynucle- 
otide. The present invention also relates to nucleotide changes which result in amino acid substitutions, additions, 
30 deletions, fusions and truncations in the polypeptide encoded by the reference polynucleotide. In a preferred aspect 
of the invention these polypeptides retain the same biological action as the polypeptide encoded by the reference 
polynucleotide. 

[0058] The polynucleotides of this invention were recovered from genomic gene libraries from the organisms listed 
in Table 1. Gene libraries were generated in the Lambda ZAP II cloning vector (Stratagene Cloning Systems). Mass 
35 excisions were performed on these libraries to generate libraries in the pBluescript phagemid. Libraries were generated 
and excisions were performed according to the protocols/methods hereinafter described. 

[0059] The polynucleotides of the present invention may be in the form of RNA or DNA which DNA includes cDNA, 
genomic DNA, and synthetic DNA. The DNA may be double-stranded or single-stranded, and if single stranded may 
be the coding strand or non-coding (anti-sense) strand. The coding sequences which encodes the mature enzymes 

40 may be identical to the coding sequences shown in Figures 1 -20 (SEQ ID NOS:23-32) and 43 to 52 or may be a different 
coding sequence which coding sequence, as a result of the redundancy or degeneracy of the genetic code, and 43-52, 
encodes the same mature enzymes as the DNA of Figures 1-20 (SEQ ID NOS:23-32 and 43-52). 
[0060] The polynucleotide which encodes for the mature enzyme of Figures 1-20 (SEQ ID NOS:33-42) and 53-62 
may include, but is not limited to: only the coding sequence for the mature enzyme; the coding sequence for the mature 

45 enzyme and additional coding sequence such as a leader sequence or a proprotein sequence; the coding sequence 
for the mature enzyme (and optionally additional coding sequence) and non-coding sequence, such as introns or non- 
coding sequence 5' and/or 3' of the coding sequence for the mature enzyme. 

[0061] Thus, the term 'polynucleotide encoding an enzyme (protein)" encompasses a polynucleotide which includes 
only coding sequence for the enzyme as well as a polynucleotide which includes additional coding and/or non-coding 
50 sequence. 

[0062] The present invention further relates to variants of the hereinabove described polynucleotides which encode 
for fragments, analogs and derivatives of the enzymes having the deduced amino acid sequences of Figures 1-20 
(SEQ ID NOS:33-42 and 53-62). The variant of the polynucleotide may be a naturally occurring allelic variant of the 
polynucleotide or a non-naturally occurring variant of the polynucleotide. 
55 [0063] Thus, the present invention includes polynucleotides encoding the same mature enzymes as shown in Figures 
1*20 (SEQ ID NOS:23-32 and 43-52) as well as variants of such polynucleotides which variants encode for a fragment, 
derivative or analog of the enzymes of Figures 1-20 (SEQ ID NOS:23-32 and 43 to 52). Such nucleotide variants 
include deletion variants, substitution variants and addition or insertion variants. 
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[0064] As hereinabove indicated, the polynucleotides may have a coding sequence which is a naturally occurring 
allelic variant of the coding sequences shown in Figures 1-20 (SEQ ID NOS:23-32 and 43 to 52). As known in the art, 
an allelic variant is an alternate form of a polynucleotide sequence which may have a substitution, deletion or addition 
of one or more nucleotides, which does not substantially alter the function of the encoded enzyme. 

5 [0065] Fragments of the full length gene of the present invention may be used as hybridization probes for a cDNA 
or a genomic library to isolate the full length DNA and to isolate other DNAs which have a high sequence similarity to 
the gene or similar biological activity. Probes of this type preferably have at least 10, preferably at least 15, and even 
more preferably at least 30 bases and may contain, for example, at least 50 or more bases. The probe may also be 
used to identify a DNA clone corresponding to a full length transcript and a genomic clone or clones that contain the 

10 complete gene including regulatory and promoter regions, exons and introns. An example of a screen comprises iso- 
lating the coding region of the gene by using the known DNA sequence to synthesize an oligonucleotide probe. Labeled 
oligonucleotides having a sequence complementary to that of the gene of the present invention are used to screen a 
library of genomic DNA to determine which members of the library the probe hybridizes to. 

[0066] It is also appreciated that such probes can be and are preferably labeled with an analytically detectable reagent 

15 to facilitate identification of the probe. Useful reagents include but are not limited to radioactivity, fluorescent dyes or 
enzymes capable of catalyzing the formation of a detectable product. The probes are thus useful to isolate comple- 
mentary copies of DNA from other sources or to screen such sources for related sequences. 
[0067] The present invention further relates to polynucleotides or fragments thereof encoding an enzyme having 
esterase activity and which hybridize to the hereinabove-described sequences if there is at least 70%, preferably at 

20 (east 90%, and more preferably at least 95% identity between the sequences. The present invention particularly relates 
to polynucleotides which hybridize under stringent conditions to the hereinabove-described polynucleotides. As herein 
used, the term "stringent conditions' means hybridization will occur only if there is at least 95 % and preferably at least 
97 % identity between the sequences. The polynucleotides which hybridize to the hereinabove described polynucle- 
otides in a preferred embodiment encode enzymes which either retain substantially the same biological function or 

25 activity as the mature enzyme encoded by the DNA of Figures 1-20 (SEQ ID NOS:23-32 and 43 to 52.) 

[0068] Alternatively, the polynucleotide may have at least 1 5 bases, preferably at least 30 bases, and more preferably 
at least 50 bases of a polynucleotide of the present invention and which has an identity thereto, as hereinabove de- 
scribed, and which may or may not retain activity. For example, such polynucleotides may be employed as probes for 
the polynucleotides of SEQ ID NOS:23-32, and 43 to 52 for example, for recovery of the polynucleotide or as a diag- 

30 nostic probe or as a PC R primer. 

[0069] Thus, the present invention is directed to polynucleotides or fragments thereof encoding an enzyme having 
esterase activity and having at least a 70% identity, preferably at least 90% identity and more preferably at least a 95% 
identity to a polynucleotide which encodes the enzymes of SEQ ID NOS:33-42 and 53 to 62 as well as fragments 
thereof, which fragments have at least 15 bases, preferably at least 30 bases and most preferably at least 50 bases. 

35 which fragments are at least 90% identical, preferably at least 95 % identical and most preferably at least 97% identical 
under stringent conditions to any portion of a polynucleotide of the present invention. 

[0070] The present invention further relates to enzymes which have the deduced amino acid sequences of Figures 
1-20 (SEQ ID NOS:23-32 and 43 to 52) as well as fragments, analogs and derivatives of such enzyme. 
[0071] The terms "fragment," "derivative" and "analog" when referring to the enzymes of Figures 1-20 (SEQ ID NOS: 
to 33-42 and 53 to 62) mean enzymes which retain essentially the same biological function or activity as such enzymes. 
Thus, an analog includes a proprotein which can be activated by cleavage of the proprotein portion to produce an 
active mature enzyme. 

[0072] The enzymes of the present invention may be a recombinant enzyme, a natural enzyme or a synthetic enzyme, 
preferably a recombinant enzyme. 

45 [0073] The fragment, derivative or analog of the enzymes of Figures 1-20 (SEQ ID NOS:33-42 and 53 to 62) may 
be (i) one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino 
acid residue (preferably a conserved amino acid residue) and such substituted amino acid residue may or may not be 
one encoded by the genetic code, or (ii) one in which one or more of the amino acid residues includes a substituent 
group, or (iii) one in which the mature enzyme is fused with another compound, such as a compound to increase the 

so half-life of the enzyme (for example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the mature enzyme, such as a leader or secretory sequence or a sequence which is employed for purification of the 
mature enzyme or a proprotein sequence. Such fragments, derivatives and analogs are deemed to be within the scope 
of those skilled in the art from the teachings herein. 

[0074] The enzymes and polynucleotides of the present invention are preferably provided in an isolated form, and 
55 preferably are purified to homogeneity. 

[0075] The term "isolated" means that the material is removed from its original environment (e.g., the natural envi- 
ronment if it is naturally occurring). For example, a naturally-occurring polynucleotide or enzyme present in a Irving 
animal is not isolated, but the same polynucleotide or enzyme, separated from some or all of the coexisting materials 
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in the natural system, is isolated. Such polynucleotides could be part of a vector and/or such polynucleotides or enzymes 
could be part of a composition, and still be isolated in that such vector or composition is not part of its natural environ- 
ment. 

[0076] The enzymes of the present invention include the enzymes of SEQ ID NOS:33-42 and 53 to 62 (in particular 
5 the mature enzyme) as well as enzymes which have at least 70 % similarity (preferably at least 70% identity) to the 
enzymes of SEQ ID NOS:33-42 and 53 to 62 and more preferably at least 90% similarity (more preferably at least 90% 
identity) to the enzymes of SEQ ID NOS:33-42 and 53 to 62 and still more preferably at least 95% similarity (still more 
preferably at least 95% identity) to the enzymes of SEQ ID NOS:33-42 and 53 to 62 and also include portions of such 
enzymes with such portion of the enzyme generally containing at least 30 amino acids and more preferably at least 
10 50 amino acids. 

[0077] As known in the art "similarity" between two enzymes is determined by comparing the amino acid sequence 
and its conserved amino acid substitutes of one enzyme to the sequence of a second enzyme. 
[0078] A variant, Le. a "fragment", "analog" or "derivative" polypeptide, and reference polypeptide may differ in amino 
acid sequence by one or more substitutions, additions, deletion^ fusions and truncations, which may be present in 
15 any combination. 

[0079] Among preferred variants are those that vary from a reference by conservative amino acid substitutions. Such 
substitutions are those that substitute a given amino acid in a polypeptide by another amino acid of like characteristics. 
Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu and He; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, 
2b substitution between the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and replacements 
among the aromatic residues Phe, Tyr. 

[0080] Most highly preferred are variants which retain the same biological function and activity as the reference 
polypeptide from which it varies. 

[0081] Fragments or portions of the enzymes of the present invention may be employed for producing the corre- 
25 sponding full-length enzyme by peptide synthesis; therefore, the fragments may be employed as intermediates for 
producing the full-length enzymes. Fragments or portions of the polynucleotides of the present invention may be used 
to synthesize full-length polynucleotides of the present invention. 

[0082] The present invention also relates to vectors which include polynucleotides of the present invention, host cells 
which are genetically engineered with vectors of the invention and the production of enzymes of the invention by 

30 recombinant techniques. 

[0083] Host cells are genetically engineered (transduced or transformed or transfected) with the vectors of this in- 
vention which may be, for example, a cloning vector or an expression vector. The vector may be, for example, in the 
form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be cultured in conventional nutrient 
media modified as appropriate for activating promoters, selecting transformants or amplifying the genes of the present 

35 invention. The culture conditions, such as temperature, pH and the like, are those previously used with the host cell 
selected for expression, and will be apparent to the ordinarily skilled artisan. 

[0084] The polynucleotides of the present invention may be employed for producing enzymes by recombinant tech- 
niques. Thus, for example, the polynucleotide may be included in any one of a variety of expression vectors for ex- 
pressing an enzyme. Such vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., de- 
40 rivatives of SV40; bacterial plasm ids; phage DNA; baculo virus; yeast plasmids; vectors derived from combinations of 
plasm ids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, any 
other vector may be used as long as it is replicable and viable in the host. 

[0085] The appropriate DNA sequence may be inserted into the vector by a variety of procedures. In general, the 
DNA sequence is inserted into an appropriate restriction endonuclease site(s) by procedures known in the art. Such 

45 procedures and others are deemed to be within the scope of those skilled in the art. 

[0086] The DNA sequence in the expression vector is operatively linked to an appropriate expression control se- 
quence(s) (promoter) to direct mRNA synthesis. As representative examples of such promoters, there may be men- 
tioned: LTR or SV40 promoter, the E. coli. lac or trp, the phage lambda P L promoter and other promoters known to 
control expression of genes in prokaryotic or eukaryotic cells or their viruses. The expression vector also contains a 

50 ribosome binding site for translation initiation and a transcription terminator. The vector may also include appropriate 
sequences for amplifying expression. 

[0087] In addition, the expression vectors preferably contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or neomycin resistance for 
eukaryotic cell culture, or such as tetracycline or ampicillin resistance in E. coli. 
55 [0088] The vector containing the appropriate DNA sequence as hereinabove described, as well as an appropriate 
promoter or control sequence, may be employed to transform an appropriate host to permit the host to express the 
protein. 

[0089] As representative examples of appropriate hosts, there may be mentioned: bacterial cells, such as E. coli. 
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Strepromyces, Bacillus subtitis; fungal cells, such as yeast; insect cells such as Drosophila S2 and Spodoptera Sf9; 
animal cells such as CHO, COS or Bowes melanoma: adenoviruses; plant cells, etc. The selection of an appropriate 
host is deemed to be within the scope of those skilled in the art from the teachings herein. 

[0090] More particularly, the present invention also includes recombinant constructs comprising one or more of the 
5 sequences as broadly described above. The constructs comprise a vector, such as a plasmid or viral vector, into which 
a sequence of the invention has been inserted, in a forward or reverse orientation. In a preferred aspect of this em- 
bodiment, the construct further comprises regulatory sequences, including, for example, a promoter, operably linked 
to the sequence. Large numbers of suitable vectors and promoters are known to those of skill in the art, and are 
commercially available. The following vectors are provided by way of example; Bacterial: pQE70, pQE60, pQE-9 (Qia- 
10 gen), pBIuescript II KS, ptrc99a, pKK223-3, pDR540, pRIT2T (Pharmacia); Eukaryotic: pXT1, pSG5 (Stratagene) 
pSVK3, pBPV, pMSG, pSVL, SV40 (Pharmacia). However, any other plasmid or vector may be used as long as they 
are replicable and viable in the host. 

[0091] Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors 
or other vectors with selectable markers. Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
15 promoters include lacl, lacZ, T3, T7, gpt. lambda P R , P L and trp. Eukaryotic promoters include CMV immediate early, 
HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-l. Selection of the ap- 
propriate vector and promoter is well within the level of ordinary skill in the art. 

[0092] In a further embodiment, the present invention relates to host cells containing the above-described constructs. 
The host cell can be a higher eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a yeast 
20 cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell 
can be effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, or electroporation (Davis, 
L, Dibner, M., Battey, I., Basic Methods in Molecular Biology, (1986)). 

[0093] The constructs in host cells can be used in a conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternative ly, the enzymes of the invention can be synthetically produced by conventional 
25 peptide synthesizers. 

[0094] Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of 
appropriate promoters. Cell-free translation systems can also be employed to produce such proteins using RNAs de- 
rived from the DN A constructs of the present invention. Appropriate cloning and expression vectors for use with prokary- 
otic and eukaryotic hosts are described by Sambrook era/., Molecular Cloning: A Laboratory Manual, Second Edition, 

30 Cold Spring Harbor, N.Y., (1989), the disclosure of which is hereby incorporated by reference. 

[0095] Transcription of the DNA encoding the enzymes of the present invention by higher eukaryotes is increased 
by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 
to 300 bp that act on a promoter to increase its transcription. Examples include the SV40 enhancer on the late side of 
the replication origin bp 100 to 270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the late 

35 side of the replication origin, and adenovirus enhancers. 

[0096] Generally, recombinant expression vectors will include origins of replication and selectable markers permitting 
transformation of the host cell, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a 
promoter derived from a highly-expressed gene to direct transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK). a- 
factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural sequence is assembled 
in appropriate phase with translation initiation and termination sequences, and preferably, a leader sequence capable 
of directing secretion of translated enzyme. Optionally, the heterologous sequence can encode a fusion enzyme in- 
cluding an N-terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified purification 
of expressed recombinant product. 

45 [0097] Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding 
a desired protein together with suitable translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic selectable markers and an origin of replication 
to ensure maintenance of the vector and to, if desirable, provide amplification within the host. Suitable prokaryotic 
hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimuriumand various species within the genera 

50 Pseudomonas, Streptomyces, and Staphylococcus, although others may also be employed as a matter of choice. 
[0098] As a representative but nonlimiting example, useful expression vectors for bacterial use can comprise a se- 
lectable marker and bacterial origin of replication derived from commercially available plasm ids comprising genetic 
elements of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 
pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl, USA). These 

55 pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be expressed. 
[0099] Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, 
the selected promoter is induced by appropriate means {e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. 
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[0100] Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting 
crude extract retained for further purification. 

[0101] Microbial cells employed in expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, such methods are well known to 
5 those skilled in the art. 

[01 02] Various mammalian cell culture systems can also be employed to express recombinant protein. Examples of 
mammalian expression systems include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell, 
23: 175 (1 981 ), and other cell lines capable of expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa 
and BHK cell lines. Mammalian expression vectors will comprise an origin of replication, a suitable promoter and en- 
10 hancer, and also any necessary ribosome binding sites, polyadenyiation site, splice donor and acceptor sites, tran- 
scriptional termination sequences, and 5' flanking nontranscribed sequences. DNA sequences derived from the SV40 
splice, and polyadenyiation sites may be used to provide the required nontranscribed genetic elements. 
[0103] The enzyme can be recovered and purified from recombinant cell cultures by methods including ammonium 
sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chroma- 
's tography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Protein refolding steps can be used, as necessary, in completing configuration of the mature protein. 
Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps. 
[01 04] The enzymes of the present invention may be a naturally purified product, or a product of chemical synthetic 
procedures, or produced by recombinant techniques from a prokaryotic or eukaryotic host (for example, by bacterial, 
20 yeast, higher plant, insect and mammalian cells in culture). Depending upon the host employed in a recombinant 
production procedure, the enzymes of the present invention may be glycosylated or may be non-glycosylated. Enzymes 
of the invention may or may not also include an initial methionine amino acid residue. 

[01 05] Esterases are a group of key enzymes in the metabolism of fats and are found in all organisms from microbes 
to mammals. In the hydrolysis reaction, an ester group is hydrolysed to an organic acid and an alcohol. 

25 [0106] Esterases ehantiomericaily differentiate dicarboxylic diesters and diacetates of diols. Using the approach 
disclosed in a commonly assigned, copending provisional application Serial No. 60/008,316, filed on December 7, 
1995 and entitled "Combinatorial Enzyme Development," the disclosure of which is incorporated herein by reference 
in its entirety, one could convert the enantiospecif icity of the esterase. Further, the thermostable esterases are believed 
to have superior stability at higher temperatures and in organic solvents. Thus, they are better suited for use in rigorous 

30 production procees which require robust catalysts. 

[0107] There are a number of industrial and scientific applications for esterases, such as those of the present inven- 
tion, including: 

1) Esterases are useful in the dairy industry as ripening starters for cheeses, such as the Swiss-type cheeses; 

35 

2) Esterases are useful in the pulp and paper industry for lighin removal from cellulose pulps, for Ngnin solubilization 
by cleaving the ester linkages between aromatic acids and lignin and between lignin and hemicellu loses, and for 
disruption of cell waif structure when used in combinatioft^tr^xylahase and other xylan-degrading enzymes in 
biopulping and biobleaching of pulps; 

40 

3) Esterases are useful in the synthesis of carbohydrate derivatives, such as sugar derivatives; 

4) Esterases are useful, when combined with xylanases and cellulases, in the conversion of lignocellulosic wastes 
to fermentable sugars for producing a variety of chemicals and fuels; 

45 

5) Esterases are useful as research reagents in studies on plant cell wall structure, particularly the nature of cov- 
alent bonds between lignin and carbohydrate polymers in the cell wall matrix; 

6) Esterases are also useful as research reagents in studies on mechanisms related to disease resistance in plants 
so and the process of organic matter decomposition; and 

7) Esterases are useful in selection of plants bred for production of highly digestible animal feeds, particularly for 
ruminant animals. 

55 [0108] Antibodies generated against the enzymes corresponding to a sequence of the present invention can be 
obtained by direct injection of the enzymes into an animal or by administering the enzymes to an animal, preferably a 
nonhuman. The antibody so obtained will then bind the enzymes itself. In this manner, even a sequence encoding only 
a fragment of the enzymes can be used to generate antibodies binding the whole native enzymes. Such antibodies 
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can then be used to isolate the enzyme from cells expressing that enzyme. 

[0109] For preparation of monoclonal antibodies, any technique which provides antibodies produced by continuous 
cell line cultures can be used. Examples include the hybridoma technique (Kohler and Milstein, Nature, 256:495-497, 
1975), the trioma technique, the human B-cell hybridoma technique (Kozbor et al., Immunology Today 4:72, 1983), 
5 and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et a/., in Monoclonal Antibodies 
and Cancer Therapy Alan R. Liss, Inc., pp. 77-96, 1985). 

[0110] Techniques described for the production of single chain antibodies (U.S. Patent 4.946.778) can be adapted 
to produce single chain antibodies to immunogenic enzyme products of this invention. Also, transgenic mice may be 
used to express humanized antibodies to immunogenic enzyme products of this invention. 

to [0111] Antibodies generated against an enzyme of the present invention may be used in screening for similar en- 
zymes from other organisms and samples. Such screening techniques are known in the art, for example, one such 
screening assay is described in Sambrook et a/., Molecular Cloning: A Laboratory Manual (2d Ed.), Cold Spring Harbor 
Laboratory, Section 12.21-12.28 (1989) which is hereby incorporated by reference in its entirety. 
[01 12] The present invention will be further described with reference to the following examples; however, it is to be 

is understood that the present invention is not limited to such examples. All parts or amounts, unless otherwise specified, 
are by weight. 

[0113] In order to facilitate understanding of the following examples certain frequently occurring methods and/or 
terms will be described. 

[01 14] "Plasmids" are designated by a lower case "p" preceded and/or followed by capital letters and/or numbers. 
20 The starting plasmids herein are either commercially available, publicly available on an unrestricted basis, or can be 
constructed from available plasmids in accord with published procedures. In addition, equivalent plasmids to those 
described are known in the art and will be apparent to the ordinarily skilled artisan. 

[01 15] ■Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction enzyme that acts only at certain 
sequences in the DNA. The various restriction enzymes used herein are commercially available and their reaction 

25 conditions, cofactors and other requirements were used as would be known to the ordinarily skilled artisan. For ana- 
lytical purposes, typically 1 u.g of plasmid or DNA fragment is used with about 2 units of enzyme in about 20 jxl of buffer 
solution. For the purpose of isolating DNA fragments for plasmid construction, typically 5 to 50 fig of DNA are digested 
with 20 to 250 units of enzyme in a larger volume. Appropriate buffers and substrate amounts for particular restriction 
enzymes are specified by the manufacturer. Incubation times of about 1 hour at 37' C are ordinarily used, but may vary 

30 in accordance with the supplier's instructions. After digestion the reaction is electrophoresed directly on a polyacryla- 
mide gel to isolate the desired fragment. 

[01 1 6] Size separation of the cleaved fragments is performed using 8 percent polyacrylamide gel described by Goed- 
del et a/., Nucleic Acids Res., 8:4057 (1980). 

[01 17] "Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two complementary polydeoxy- 
35 nucleotide strands which may be chemically synthesized. Such synthetic oligonucleotides have no 5' phosphate and 
thus will not ligate to another oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oiigonucleotide will ligate to a fragment that has not been dephosphorytated. 

[0118] "Ligation" refers to the process of forming phosphodtester bonds between two double stranded nucleic acid 
fragments (Maniatis, T, et a/., Id, p. 146). Unless otherwise provided, ligation may be accomplished using known 
to buffers and conditions with 1 0 units of T4 DNA ligase fligase") per 0.5 ng of approximately equimolar amounts of the 
DNA fragments to be ligated. 

[01 19] Unless otherwise stated, transformation was performed as described in Sambrook et a/., Molecular Cloning: 
A Laboratory Manual (2d Fd.), Cold Spring Harbor Press (1989). 

45 Example 1 

Bacterial Expression and Purification off Esterases 

[0120] DNA encoding the enzymes of the present invention, SEQ ID NOS:33 through 42, were initially amplified from 
50 a pBluescript vector containing the DNA by the PCR technique using the primers noted herein. The amplified sequences 
were then inserted into the respective PQE vector listed beneath the primer sequences, and the enzyme was expressed 
according to the protocols set forth herein. The 5' and 3' primer sequences for the respective genes are as follows: 

Staphylothermus mahnus F1 -1 2LC 

55 
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5' CCGAGAATTC ATTAAAGAGO AGAAATTAAC TATGTCTTTA AACAAGCACT CT 
3' CGOAAGATCT CTATCGTTTA GTGTATGATT T 

vector: pQET 
Pyrodictium TAG 1 1 -1 7LC 

5' CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAAACTC CTTGAGCCCA CA EcoRI 
3* CGGAAGATCT CGCCGGTACA CCATCAGCCA C BglH 

vector: pQET 

Archaeoglobus venificus SNP6-24LC 

5' CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCATAT GTTAGGAATG GT 

3' CGGAGGTACC TTAGAACTGT GCTGAAGAAA TAAATTCGTC CATTGCTCT 

3' CGGAGGTACC TTAGAACTGT GCTGAAGAAA TAAATTCGTC CATTGCTCTA TTA 

vector: pQET 

Aquifex pyrophilus - 28LC 

3' CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGATTG AGGAAATTTG AAG 
3' CGGAGGTACC CTATTCAGAA AGTACCTCTA A 

vector: pQET 
M11TL-29LC 

5' CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGTTTAAT ATCAATGTCT TT 
3' CGGAAGATCT TTAAGGATTT TCCCTGGGTA G 

vector: pQET 

Thermococcus CL-2 - 30LC 

5' CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGAGGTT TACAAGGCCA AA 
3* CGGAGGTACC TTATTGAGCC GAAGACTACG A 

vector: pQET 
/*qu/fexVF5-34LC 
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5' CCCACAATTC ATTAAAGAGG AGAAATTAAC TATGATTGGC AATTTGAAAT TGA EcoRI 
3* CGGAGGTACC TTAAAGTGCT CTCATATCCC C Kpni 

vector pQET 
Teredinibacter 42L 

5' CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCAGCT AATGACTCAC CC 
3* CGOAAGATCT TCAACAGGCT CCAAATAATT TC (wifbout Hb-tag) 
3' CGGAAGATCT ACAGGCTCCA AATAATTTC (with His^ag) 


vector pQE12 

Archaeoglobus fulgidus VC1 6-1 6MC 

3*" CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCTTGAT ATGCCAATCG AC EcoRI 
3* CGGAGGTACC CTAGTCGAAG ACAAGAAGAG C Kpnl 

vector pQET 

Sutfolabus solfataricus P1-8LC 

5' CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCCCAG GATCCTAGAA TT EcoRI 
3* CGGAGGTACC TTAAATTTTA TCATAAAATA C KpnJ 

vector: pQET 

35 [0121] The restriction enzyme sites indicated correspond to the restriction enzyme sites on the bacterial expression 
vector indicated for the respective gene (Qiagen, Inc. Chatsworth, CA). The pQE vector encodes antibiotic resistance 
(Ampf), a bacterial origin of replication (ori). an IPTG-regulatable promoter operator (P/O), a ribosome binding site 
(RBS), a 6-His tag and restriction enzyme sites. 

[0122] The pQE vector was digested with the restriction enzymes indicated. The amplified sequences were ligated 
<0 into the respective pQE vector and inserted in frame with the sequence encoding for the RBS. The ligation mixture 
was then used to transform the E. coti strain M1 5/pREP4 (Qiagen, Inc.) by electroporation. M1 5/pREP4 contains mul- 
tiple copies of the plasm id pREP4, which expresses the lacl repressor and also confers kanamycin resistance (KanQ. 
Transformants were identified by their ability to grow on LB plates and ampicillin/kanamycin resistant colonies were 
selected. Plasm id DNA was isolated and confirmed by restriction analysis. Clones containing the desired constructs 
<5 were grown overnight (O/N) in liquid culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ 
ml). The O/N culture was used to inoculate a large culture at a ratio of 1:100 to 1:250. The cells were grown to an 
optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG ("Isopropyl-B-D-thiogalacto pyranoside") was then added 
to a final concentration of 1 mM. IPTG induces by inactivating the lacl repressor, clearing the P/O leading to increased 
gene expression. Cells were grown an extra 3 to 4 hours. Cells were then harvested by centrifugation. 
so [0123] The primer sequences set out above may also be employed to isolate the target gene from the deposited 
material by hybridization techniques described above. 

Example 2 

55 Isolation of a Selected Clone from the Deposited Genomic Clones 

[0124] The two oligonucleotide primers corresponding to the gene of interest are used to amplify the gene from the 
deposited material. A polymerase chain reaction is carried out in 25 uJ of reaction mixture with 0.1 u.g of the DNA of 
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the gene of interest. The reaction mixture is 1.5-5 mM MgC^, 0.01 % (w/v) gelatin, 20 uM each of dATP, dCTP, dGTP, 
dTTP. 25 pmol of each primer and 1 .25 Unit of Taq polymerase. Thirty cycles of PCR (denaturation at 94°C for 1 min; 
annealing at 55°C for 1 min; elongation at 72° C for 1 min) are performed with the Perkin-Elmer Cetus 9600 thermal 
cycler. The amplified product is analyzed by agarose gel electrophoresis and the DNA band with expected molecular 
5 weight is excised and purified. The PCR product is verified to be the gene of interest by subcloning and sequencing 
the DNA product. 

Example 3 

10 Production of the Expression Gene Bank 

[0125] Colonies containing pBluescript plasmids with random inserts from the organisms M11TL, Thermococcus 
GU5L5, and Teredinibacterwere obtained according to the method of Hay and Short, Strategies, 5:16, 1992. 

15 Example 4 

Screening for Lipase/Esterase Activity 

[0126] The resulting colonies were picked with sterile toothpicks and used to singly inoculate each of the wells of 
20 96-well microtiter plates. The wells contained 250 u.L of LB media with 1 00 u.g/mL ampicillin, 80 fig/mL methicillin, and 
10% v/v glycerol (LB Amp/Meth. glycerol). The cells were grown overnight at 37°C without shaking. This constituted 
generation of the "Source GeneBank." Each well of the Source GeneBank thus contained a stock culture of E. coli 
cells, each of which contained a pBluescript with a unique DNA insert. 

[0127] The plates of the Source GeneBank were used to multiply inoculate a single plate (the "Condensed Plate") 
25 containing in each well 200 uL of LB Amp/Meth, glycerol. This step was performed using the High Density Replicating 
Tool (HDRT) of the Beckman Biomek with a 1 % bleach, water, isopropanol, air-dry sterilization cycle in between each 
inoculation. Each well of the Condensed Plate thus contained 10 to 12 different pBluescript clones from each of the 
source library plates. The Condensed Plate was grown for 16 hours at 37°C and then used to inoculate two white 
96-well Polyfiltronics microtiter daughter plates containing in each well 250 ul of LB Amp/Meth (no glycerol). The 
30 original condensed plate was put in storage -80° C. The two condensed daughter plates were incubated at 37°C for 
18 hours. 

[0128] The short chain esterase '600 u.M substrate stock solution' was prepared as follows: 25 mg of each of the 
following compounds was dissolved in the appropriate volume of DMSO to yield a 25.2 mM solution. The compounds 
used were 4-methylumbelliferyl proprionoate, 4-methylumbelliferyl butyrate, and 4-methylumbelliferyl heptanoate. Two 
35 hundred fifty microliters of each DMSO solution was added to ca 9 mL of 50 mM, pH 7.5 Hepes buffer which contained 
0.6% of Triton X-100 and 0.6 mg per mL of dodecyl maltoside (Anatrace). The volume was taken to 10.5 mL with the 
above Hepes buffer to yield a slightly cloudy suspension. 

[0129] The long chain '600 u.M substrate stock solution' was prepared as follows: 25 mg of each of the following 
compounds was dissolved in DMSO to 25.2 mM as above. The compounds used were 4-methylumbelliferyl elaidate, 

40 4-methylumbelliferyl palmitate, 4-methylumbelliferyl oleate, and 4-methylumbelliferyl stearate. All required brief warm- 
ing in a 70°C bath to achieve dissolution. Two hundred fifty microliters of each DMSO solution was added to the Hepes 
buffer and diluted to 10.5 mL as above. All seven umbelliferones were obtained from Sigma Chemical Co. 
[0130] Fifty u.L of the long chain esterase or short chain esterase '600 u.M substrate stock solution 1 was added to 
each of the wells of a white condensed plate using the Biomek to yield a final concentration of substrate of about 100 

45 |iM.. The fluorescence values were recorded (excitation = 326 nm, emission = 450 nm) on a plate-reading fluorometer 
immediately after addition of the substrate. The plate was incubated at 70° C for 60 minutes in the case of the long 
chain substrates, and 30 minutes at RT in the case of the short chain substrates. The fluorescence values were recorded 
again. The initial and final fluorescence values were compared to determine if an active clone was present. 

50 Example 5 

Isolation and Purification of the Active Clone 

[0131] To isolate the individual clone which carried the activity, the Source GeneBank plates were thawed and the 
55 individual wells used to singly inoculate a new plate containing LB Amp/Meth. As above, the plate was incubated at 
37°C to grow the cells, 50 jiL of 600 p.M substrate stock solution was added using the Biomek and the fluorescence 
was determined. Once the active well from the source plate was identified, cells from this active well were streaked on 
agar with LB/Amp/Meth and grown overnight at 37°C to obtain single colonies. Eight single colonies were picked with 
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a sterile toothpick and used to singly inoculate the wells of a 96-well microtiter plate. The wells contained 250 \iL of 
LB Amp/Meth. The cells were grown overnight at 37°C without shaking. A 200 pL aliquot was removed from each well 
and assayed with the appropriate long or short chain substrates as above. The most active clone was identified and 
the remaining 50 fiL of culture was used to streak an agar plate with LB/Amp/Meth. Eight single colonies were picked, 
5 grown and assayed as above. The most active clone was used to inoculate 3 ml_ cultures of LB/Amp/Meth, which were 
grown overnight. The plasmid DNA was isolated from the cultures and utilized for sequencing. 

SEQUENCE LISTING 

10 [0132] 

(1) GENERAL INFORMATION: 

(i) APPLICANTS: 

15 

ROBERTSON, Daniel E. 
MURPHY, Dennis 
20 REID, John 

MAFFIA, Anthony 
LINK, Steven 

25 

SWANSON, Ronald V. 
WARREN, Patrick V. 
30 KOSMOTKA, Anna 

CALLEN, Walter 

(ii) TITLE OF INVENTION: 

35 

ESTERASES 

(iii) NUMBER OF SEQUENCES: 62 
40 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CARELLA. BYRNE, BAIN, GILFILLAN, CECCHI, STEWART & OLSTEIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROSELAND 

45 (D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 

(v) COMPUTER READABLE FORM: 

50 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: WORD PERFECT 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unassigned 
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(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: Unassigned 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: HERRON, CHARLES J. 

(B) REGISTRATION NUMBER: 28,019 

(C) REFERENCE/DOCKET NUMBER: 331400-39 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

(2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 


CCGA.GAATTC ATTAAAOA.GG AGAAATTAAC TATGTCTTXA AACAAGCACT CT 


(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 


CGGAAGATCT CTATCGTTTA GTOTATGATT T 


(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLBOTIDES 

(B) TYPE: NUCLEIC ACID 
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(C) STRANDEDNESS: SINGLE . 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 


CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAAACTC CTTQAOCCCA CA 52 

10 

(2) INFORMATION FOR SEQ ID NO:4: 
(i) SEQUENCE CHARACTERISTICS 

15 (A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

20 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 


CGGAAGATCT CGCCGOTACA CCATCAGCCA C 31 


(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS 

30 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

35 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

40 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCATAT GTTAGGAATG GT 52 

(2) INFORMATION FOR SEQ ID NO:6: 
45 (j) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 53 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 
so (D) TOPOLOGY: LINEAR 

(ii) MOLECUM: TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

55 

CGGAGGTACC TTAGAACTGT GCTGAAGAAA TAAATTCGTC CATTGCTCTA TEA 53 
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(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 49 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 


COGAGOTACC TTAOAACTGT GCTGAAGAAA TAAATTCGTC CATTGCTCT 


(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 53 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 


CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGATTG AGGAAATTTG AAG 


(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 


CGGAGGTACC CTATTCAGAA AGTACCTCTA A 


(2) INFORMATION FOR SEQ ID NO:10: 
(0 SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

CCGAOAATTC ATTAAAGAGG AGAAATTAAC TATGTTTAAT ATCAATQTCT TT 

(2) INFORMATION FOR SEO ID NO:11: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

GGGAAGATCT TTAAOGATIT TCCCTGGGTA G 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE; NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

COGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGAGGTT TACAAGGCCA AA 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

CGGAGGTACC ttattgagcc gaagagxacg a 

(2) INFORMATION FOR SEQ ID NO:14: 
(i) SEQUENCE CHARACTERISTICS 
(A) LENGTH: 53 NUCLEOTIDES 
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(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CCGAGAATTC ATTAAAGAGG AJGAAATTAAC TATGATTGGC AATXTGAAAT TGA 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

CGGAGGTACC TTAAAGTGCT CTCATATCCC C 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCAGCT AATGACTCAC CC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 32 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

CGGAAGATCT TCAACAGGCT CCAAATAATT TC 
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(2) INFORMATION FOR SEQ ID NO:18: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 29 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

• (ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 8: 


CGGAAOATCT ACAGGCTCCA AATAATTTC 


(2) INFORMATION FOR SEQ ID NO:1 9: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 


CCGAQAATTC ATTAAAGAGG AGAAATTAAC TATGCTTGAT ATGCCAATCG AC 


(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 


CGGAGGTACC CTAGTCGAAC AGAAGAAGAG C 


(2) INFORMATION FOR SEQ ID NO:21: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCCCTA QATCCTAOAA TT 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

CGGAGGTACC TTAAATTTTA TCATAAAATA C 

(2) INFORMATION FOR SEO ID NO:23: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 555 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
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ATG TCT TTA AAC AAG CAC TCT TOG ATG GAT ATG ATA ATA TTT ATT CTC 48 
Met: Ser Leu Aan Lys His Ser Trp Net Asp Met He He Phe He lieu 
15 10 15 

5 AGC TTT TCT TTC CCA TTA ACA ATG ATC GCA TTA GCT ATC TCT ATG TCG 9€ 

Ser Phe Ser Phe Pro Leu Thr Met He Ala Leu Ala He ser Met Ser 
20 25 30 

TCA TGG TTT AAT ATA TGG AAT AAT GCA TTA AGC GAT CTA GGA CAT GCT 144 

Ser Trp Pbe Asn He Trp Asn Asn Ala Leu Ser Asp Leu Gly His Ala 
10 35 40 45 

GTT AAA AGC AGT GTT GCT CCA ATA TTC AAT CTA GCT CTT GCA ATT GGT 192 
Val Lys Ser Ser Val Ala Pro He Phe Asn Leu Gly Leu Ala He Gly 
50 55 60 

15 GGG ATA CTA ATT GTT ATA GTT GGT TTA AOA AAT CTT TAT TCG TGG AGT 240 

Gly He Leu He Val He Val Gly Leu Arg Asn Leu Tyr Ser Trp Ser 
SB 70 75 90 

AGA GTT AAA GGA TCT TTA ATC ATA TCC ATG GOT GTA TTT CTT AAC TTA 288 
Arg Val Lys Gly Ser Leu lie He Ser Met Gly Val Phe Leu Asn Leu 
20 85 90 9S 

ATA GGG GTT TTC GAC GAA GTA TAT GGT TGG ATA CAT TTC CIA GTC TCA 336 
He Gly Val Phe Asp Glu Val Tyr Gly Trp He His Phe Leu Val Ser 
100 105 110 

GTA TTG TTT TTC TTA TCA ATA ATA GCA TAT TTC ATA GCT ATA TCA ATA 384 
25 val Leu Phe Phe Leu Ser He He Ala Tyr Phe He Ala He Ser He 
115 120 125 

CTT GAC AAA TCA TGG ATA GCT GTT CTA CTA ATA ATA GGT CAT ATT GCA 432 
Leu Asp Lys Ser Trp He Ala Val Leu Leu He He Gly His He Ala 
130 135 140 

30 

ATG TGG TAT CTA CAC TTT GCT TCA GAG ATT COG AGA GGT GCG GCT ATT 480 
Met Trp Tyr Leu His Phe Ala Ser Glu He Pro Arg Gly Ala Ala He 
145 150 155 160 


CCC GAG TTA TTA GCG GTA TTC TCG TTT TTA CCA TTC TAT ATA AGA GAC 52 e 
Pro Glu Leu Leu Ala Val Phe Ser Phe Leu Pro Phe Tyr He Arg Asp 

165 170 175 

TAT TTT AAA TCA TAC ACT AAA CGA TAG 576 
Tyr Phe Lye Ser Tyr Thr Lys Arg 
180 


45 


(2) INFORMATION FOR SEQ ID NO:24: 
(i) SEQUENCE CHARACTERISTICS 


(A) LENGTH: 1041 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

50 (C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
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10 


ATG AAA. CTC CTT GAG CCC ACA AAT ACC TCC TAC ACG CTG TTA CAO GAT 48 
Met Lys Leu Leu Glu Pro Thr Asn Thr Ser Tyr Hir Leu Leu Gin Asp 
15 10 15 

TTA OCA TTG CAT TTT GCA TTT TAC TGG TTT CTG GCC GTG TAT ACG TOG 
96 

Leu Ala Leu His Phe Ala Phe Tyr Trp Phe Leu Ala Val Tyr Thr Trp 
20 25 30 

TTA CCC GOT GTC CTA GTC CGG GGC GTA GCT GTG GAC ACA GGG GTG GCT X44 
Leu pro Gly Val Leu Val Arg Gly Val Ala Val Asp Thr Gly Val Ala 
35 40 45 

CGG GTG CCT GGG CTC GGC CGG CGC GGT AAG AGO CTG CTC CTG GCC GCT 192 
Arg Val Pro Gly Leu Gly Arg Arg Gly Lys Arg Leu Leu Leu Ala Ala 
50 55 60 

15 GTG GCT GTC TTG GCG CTT GTT GTG TCC GTT GTT GTC CCG GCT TAT GTG 240 

Val Ala Val Leu Ala Leu Val Val Ser Val Val Val Pro Ala Tyr Val 
65 70 75 80 

GCG TAT ACT AST CTG CAC CCG GAG AGC TGT CGG CCC GTT GCG CCG GAG 288 
Ala Tyr Ser Ser Leu His Pro Glu Ser Cys Arg Pro Val Ala Pro Glu 

so ' 85 90 95 

GGG CTC ACC TAC AAA GAG TTC AGC GTG ACC GCG GAG GAT GGC TTG GTG 336 
* Gly Leu Thr Tyr Lys Glu Phe Ser Val Thr Ala Glu Asp Gly Leu Val 
100 105 110 

GTT CGG GGC TGG GTG CTG GGC CCC GGC GCT GGG GGC AAC CCG GTG TTC 384 
25 Val Arg Gly Trp Cal Leu Gly Pro Gly Ala Gly Gly Asn Pro Val Phe 

115 120 125 

GIT TTG ATG CAC GGG TAT ACT GGG TGC CGC TCG GCG CCC TAC ATG GCT 432 
Val Leu Met His Gly Tyr Thr Gly Cys Arg Ser Ala Pro Tyr Met Ala 
130 135 140 

on 

GTG CTG GCC CGG GAG CTC GTG GAG TGG GGG TAC CCG GTG GTT GTG TTC 480 
Val Leu Ala Arg Glu Leu Val Glu Trp Gly Tyr Pro Val Val Val Phe 
145 150 155 160 

GAC TTC CGG GGC CAC GGG GAG AGC GGG GGC TCG ACG ACG ATT GGG CCC 528 
35 Asp Phe Arg Gly His Gly Glu Ser Gly Gly Ser Thr Thr lie Gly Pro 


40 


45 


50 


55 
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16S 170 175 

CGG GAG GTG CM GAT GCC OGG GCT GIG GTG GGC TAT GTC TOG GAG CGG 
Arg Glu Val Leu Asp Ala Arg Ala Val Val Gly Tyr Val Ser Glu Arg 
180 185 190 

TTC CCC GGC CGC CGG ATA ATA TTG GTG GGG TTC ACT ATG GGC GGC GCT 
phe Pro Gly Arg Arg He lie Leu Val Gly Phe Ser Met Gly Gly Ala 

195 200 205 

GTA GCG ATC GTG GAG GOT GCT GGG GAC CCG CGG GTC TAC GOG GTG GCT 
Val Ala He Val Glu Gly Ala Gly Asp Pro Arg Val Tyr Ala Val Ala 
210 215 220 

GCT GAT AGC COG TAC TAT AGO CTC CGG GAC GTC ATA CCC CGG TGG CTG 
Ala Asp Ser Pro Tyr Tyr Arg Leu Arg Asp Val He Pro Arg Trp Leu 
225 230 235 240 

GAG TAC AAG ACG CCG CTG CCG GGC TGG GTG GGT GTG CTG GCC GGG TTC 
Glu Tyr Lys Thr Pro Leu Pro Gly Trp Val Gly Val Leu Ala Gly Phe 
245 250 255 

TAC GGG AGG CTG ATG GCG GGC GTT GAC CTC GGC TTC GGC CCC GCT GGG 
Tyr Gly Arg Leu Met Ala Gly Val Asp Leu Gly Phe Gly Pro Ala Gly 
2*0 2*S 270 

GTG GAG CGC GTG GAT AAG CCG TTG CTG GTG GTG TAT GGG CCC CGG GAC 
Val Olu Arg Val Asp Lys Pro Leu Leu Val Val Tyr Gly Pro Arg Asp 
275 280 285 

CCG CTG GTG ACG CGG OAC GAG GCG AGO AGC CTG GCG TCC CGT AGC CCG 
Pro Leu Val Tor Arg Asp Glu Ala Arg Ser Leu Ala Ser Arg Ser Pro 
290 295 300 

TCT GGC CGT CTC GTC GAG GTT OCT GGG GCT GGC CAC GTG GAG GCC GTG 
Cys Gly Arg Leu Val Glu Val Pro Gly Ala Oly His Val Glu Ala Val 
305 310 315 320 

GAT GTG CTC GGG CCG GGC CGC TAC GCA GAC ATG CTG ATA GAG CTG GCG 
Asp Val Leu Gly Pro Gly Arg Tyr Ala Asp Met Leu He Glu Leu Ala 
325 330 335 

CAC GAG GAG TGC CCT CCG GGG GCC GOT GGC TGA 
His Glu Glu Cys Pro Pro Gly Ala Gly Gly 
340 345 


(2) INFORMATION FOR SEO ID NO:25: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 789 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 
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10 


15 


25 


30 


35 


<xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 25; 

ATG CCA TAT GTT AGG AAT GGT GGT GTA AAT ATC TAT TAT GAA CTG GTC 
Met Pro Tyr Val Arg Asn Oly Cly Val Aan lie Tyr Tyr Glu Leu Val 
1 5 10 IS 


48 


GAT GGA CCT GAG CCA CCA ATT GTC TTT GTT CAC GGA TGG ACA GCA AAT 96 
Asp Gly Pro Glu Pro Pro lie Val Phe Val His Gly Trp Thr Ala Asn 
20 25 30 

ATG AAT TTT TGG AAA GAG GAA AGA CGT TAT TTT GCA GGC AGG AAT ATG 144 
Ket Asn Phe Trp Lys Glu Gin Arg Arg Tyr Phe Ala Gly Arg Asn Met 
35 40 45 

ATG TIG TTT GTC GAT AAC AGA GGT CAT GGC AGG TCC GAT AAG CCA CTT XS2 
Met lieu Phe Val Asp Asn Arg Gly His Gly Arg Ser Asp Lys Pro Leu 
50 55 €0 

GGA TAC GAT TTC TAC AGA TTT GAG AAC TTC ATT TCA GAT TTA GAT GCG 240 
Gly Tyr Asp Phe Tyr Arg Phe Glu Asn Phe lie Ser Asp Leu Asp Ala 
. 65 70 75 00 

GTT GTT AGG GAG ACT GGA GTG GAG AAA TTT GTT CTC GTC GGA CAT TCA 288 
Val Val Arg Glu Thr Gly Val Glu Lys Phe Val Leu Val Gly His Ser 
20 85 90 95 

TTC GGA ACA ATG ATC TCT ATG AAG TAC TGT TOG GAG TAT CGG AAT CGG 336 
Phe Gly Thr Met lie Ser Met Lys Tyr Cys Ser Glu Tyr Arg Aan Arg 
100 105 110 


GTT CTT GCT CTA ATC CTC ATA GGT GGT GGG AGC AGA ATA AAG CTT CTA 384 
Val Leu Ala Leu He Leu He Gly Gly Gly Ser Arg He Lys Leu Leu 
115 120 125 

CAC AGA ATT GGA TAT CCT TTA GCA AAG ATT CTT GCA TCC ATT GCA TAC 432 
His Arg He Gly Tyr Pro Leu Ala Lys He Leu Ala Ser He Ala Tyr 
130 135 140 

AAG AAG TCT TCA AGA TTG GTC GCA GAT CTT TCC TTT GGC AAA AAT GCT 480 
Lys Lys Ser Ser Arg Leu Val Ala Asp Leu Ser Phe Gly Lys Asn Ala 
145 150 155 160 

GGT GAA CTT AAA GAG TGG GGA TGG AAA GAG GCA ATG GAT TAT ACA CCC 528 
Gly Glu Leu Lys Glu Trp Gly Trp Lys Gin Ala Met Asp Tyr Thr Pro 
165 170 175 

TCC TAC GTG GCA ATG TAC ACC TAC AGA ACT CTA ACG* AAA GTG AAT CTT 576 
Ser Tyr Val Ala Met Tyr Thr Tyr Arg Thr Leu Thr Lys Val Asn Leu 
180 185 190 

GAA AAT ATC TTG GAG AAA ATA GAC TGT CCA ACA CTG ATT ATC GTT GGA $24 

40 Glu Asn He Leu Glu Lys He Asp Cys Pro Thr Leu He He Val Gly 

195 200 205 

GAA GAG GAT GCA CTA TTG CCC GTT ACC AAA TCA GTT GAG CTG AGC AGG S72 
Glu Glu Asp Ala Leu Leu Pro Val Ser Lys Ser Val Glu Leu Ser Arg 
210 215 220 

45 AGG ATA GAA AAC TCA AAG CTT GTG ATC ATC CCA AAC TCG GGG CAT TGC 720 

Arg He Glu Asn Ser Lys Leu Val He He Pro Asn Ser Gly His Cys 
225 230 235 240 

GTA ATG CTT GAG ACT CCA AGT GAG GTT AAT AGA GCA ATG GAC GAA TTC 768 
Val Met Leu Glu Ser Pro Ser Glu Val Asn Arg Ala Met Asp Glu Phe 
w 24S 250 255 


ATT TCT TCA GCA CAG TTC TAA 774 
55 He Ser Ser Ala Gin Phe 

260 


(2) INFORMATION FOR SEQ ID NO:26: 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 756 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULAR TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 


TTO AGA TTQ AGO AAA TXT GAA GAG ATA AAC CTC GTT CTT TCG GGA GQA 
Leu Arg Leu Arg Lys Phe Glu Glu He Aan Leu Val Leu Ser Gly QXy 
1 5 10 15 

GCT GCA AAG GGC ATA GCC CAC ATA GGT GTT TTQ AAA OCT ATA AAC GAG 
Ala Ala Lys Gly lie Ala His He Gly Val Leu Lys Ala He Asn Glu 
20 25 30 

CTC GGT ATA AGG GTG AGG GCT TtA AGC GGG GTO AGC GCC GGG GCA ATC 
Leu Gly He Arg Val Arg Ala Leu Ser Gly Val Ser Ala Gly Ala He 
35 40 45 

GTT TCG GTC TTT TAT GCC TCA GGC TAC TCC CCT GAA GGG ATO TTC AGC 
Val Ser Val Phe Tyr Ala Ser Gly Tyr Ser Pro Glu Gly Met Phe Ser 
50 55 60 

CTT CTG AAG AGG GTA AAC TGG CTG AAG CTG TTT AAG TTC AAG CCA CCT 
Leu Leu Lys Arg Val Asn Trp Leu Lys Leu Phe Lys Pbe Lye Pro Pro 
65 70 75 80 

CTG AAG GGA TTG ATA GGG TGG GAG AAG GCT ATA AGA TTC CTT GAG GAA 
Leu Lys Gly Leu He Gly Trp Glu Lys Ala He Arg Phe Leu Glu Glu . 

85 SO 95 

GTT CTC CCT TAC AGG AGA ATA GAA AAA CTT GAG ATA CCG ACQ TAT AXA 
val Leu Pro Tyr Arg Arg He Glu Lys Leu Glu He Pro Thr Tyr He 
100 105 110 

TGC GCG ACG GAT" TTA TAC TCG GGA AGG GCT CTA TAC CTC TCG GAA GGG 
Cys Ala Thr Asp Leu Tyr Ser Gly Arg Ala Leu Tyr Leu Ser Glu Gly 
115 120 125 

AGT TTA ATC CCC GCA CTT CTC GGC AGC TGT GCA ATT CCC GGC ATA TTT 
Ser Leu lie Pro Ala Leu Leu Gly Ser Cys Ala He Pro Gly He Phe 
130 135 140 

GAA CCC GTT GAG TAT AAG AAT TAC TTG CTC GTT GAC GGA GGT ATA GTT 
Glu Pro Val Glu Tyr Lys Asn Tyr Leu I*eu Val Asp Gly Gly He Val 
14S 150 155 160 

AAC AAC CTT CCC GTT GAG CCC TTT CAG GAA AGC GGT ATT CCC ACC GTT 
Asn Asn Leu Pro Val Glu Pro Phe Gin Glu Ser Gly lie Pro Thr Val 
165 170 175 

TGC GTT GAT GTC CTT CCC ATA GAG CCG GAA AAG GAT ATA AAG AAC ATT 
Cya Val Asp Val Leu Pro He Glu Pro Glu Lys Asp He Lys Asn He 
180 165 190 
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err cac atc ctt ttg agg agc ttc ttt ctt gcg gtc cgc tca aac tec 

Leu His He lieu Leu Arg Ser Phe Phe Leu Ala Val Arg Ser Aan Ser 
195 200 205 

GAA AA6 AGA AAO GAG TTT TGT GAC CTC GTT ATA GTT CCT GAG CTT GAG 
Glu Lyre Arg Lys Glu Phe Cys Asp Leu Val lie Val Pro Glu Leu Glu 
210 215 220 

GAG TTC ACA CCC CTT GAT GTT AGA AAA GCG GAC GAA ATA ATG GAG AGG 
Glu Phe Thr Pro Leu Asp Val Arg Lys Ala Asp Gin tie Met Glu Arg 
225 230 235 240 

GGA TAC ATA AAG GCC TTA GAG GTA CTT TCT GAA TAG 
Gly Tyr He Lys Ala Leu Glu Val Leu Ser Glu 
245 250 


(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 894 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULAR TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 


ATG TTT AAT ATC AAT GTC TTT GTT AAT ATA TCT TGG CTG TAT TTT TCA 
Met Phe Asn He Asn Val Phe Val Asn He Ser Trp Leu Tyr Phe Ser 
x 5 10 15 

GGG ATA GTT ATG AAG ACT GTG GAA GAG TAT GCG CTA CTT GAA ACA GGC 
Gly He Val Met Lys Thr Val Glu Glu Tyr Ala Leu Leu Glu Thr Gly 
20 25 30 

GTA AGA GTG TTT TAT CGG TGT GTA ATC CCG GAG AAA GCT TTT AAC ACT 
Val Arg Val Phe Tyr Arg Cys Val He Pro Glu Lya Ala Phe Aan Thr 
35 40 45 

TTG ATA ATA GGT TCA CAC GGA TTG GGG GCG CAC AGT GGA ATC TAC ATT 
Leu He He Gly Ser His Gly Leu Gly Ala His Ser Gly He Tyr He 
50 55 60 

AGT GTT GCT GAA GAA TTT GCT AGO CAC GGA TTT GGA TTC TGC ATG CAC 
Ser Val Ala Glu Glu Phe Ala Arg His Gly Phe Gly Phe Cys Met His 
65 70 75 80 

GAT GAA AGG GGA CAT GGG AGA ACG GCA AGC GAT AGA GAA AGA GGG TAT 
Asp Gin Arg Gly His Gly Arg Thr Ala Ser Asp Arg Glu Arg Gly Tyr 
85 90 95 

GTG GAG GGC TTT CAC AAC TTC ATA GAG GAT ATG AAG GCC TTC TCC GAT 
Val Glu Gly Phe His Asn Phe He Glu Asp Met Lys Ala Phe Ser Asp 
100 105 110 

TAT GCC AAG TGG CGC GTG GGA GGT GAC GAA ATA ATA TTG CTA GGA CAC 
Tyr Ala Lys Trp Arg Val Gly Gly Asp Glu He He Leu Leu Gly His 
115 120 125 

AGT ATG GGC GGG CTG ATA GCG CTC TTA ACA GTT GCA ACT TAT AAA GAA 
Ser Met Gly Gly Leu lie Ala Leu Leu Thr Val Ala Thr Tyr Lys Glu 
130 135 140 

ATC GCC AAG GGA GTT ATC GCG CTA GCC CCG GCC CTC CAA ATC CCC TTA 
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He Ala Lye Gly val lie Ala Leu Ala Pro Ala Leu Gin He Pro Leu 
145 150 155 160 

ACC COS OCT ASA AGA CTT GTT CTA AGO CTC GCG TCA AGG CTT GCC CCG 
Thr Pro Ala Arg Arg Leu Val Leu Ser Leu Ala Ser Arg Leu Ala Pro 
165 170 175 

CAT TCT AAG ATC ACC TEA CAA AGG AGA TIG CCG CAG AAA CCA GAG GGT 
His Ser Irys He Thr Leu Gin Arg Arg Leu Pro Gin Lys Pro Glu Gly 
180 185' 150 

TTT CAA AGA GCA AAA GAT ATA GAA TAC AGT CTG AGT GAA ATA TCA GTC 
Phe Gin Arg Ala Lys Asp He Glu Tyr Ser Leu Ser Glu He Ser Val 
195 200 205 

AAG CTC GTG GAC GAA ATG ATT AAA GCA TCA TCT ATC TTC TGG ACC ATA 
Lys Leu Val Asp Glu Met He Lys Ala Ser Ser Net Phe Trp Thr He 
210 215 220 

GCA GOG GAA ATT AAT ACT CCC GTC CTG CTT ATT CAT GGG GAA AAA GAC 
Ala Gly Glu He Asn Thr Pro Val Leu Leu He His Gly Glu Lys Asp 
225 230 23S 240 

AAT GTC ATA CCT CCG GAG GCG AGC AAA AAA GCC TAC CAA TTA ATA CCT 
Asn Val He Pro Pro Glu Ala Ser Lys Lys Ala Tyr Gin Leu He Pro 
245 250 255 

TCA TTC CCT AAA GAG TTG AAA ATA TAC CCC GAT CTT GGA CAC AAC TTQ 
Ser Phe Pro Lys Glu Leu Lys He Tyr Pro Asp Leu Gly His Asn Leu 
260 2(5 270 

TTT TTT GAA CCA GGC GCG GTG AAA ATC GTC ACA GAC ATT GTA GAG TGG 
Phe Phe Glu Pro Gly Ala Val Lys He Val Thr Asp He Val Glu Trp 
275 280 285 

GTT AAG AAT CTA CCC AGG GAA AAT CCT TAA 
Val Lys Asn Leu Pro Arg Glu Asn Pro 
290 295 


(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 789 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 


ATG GAG GTT TAC AAG GCC AAA TTC GGC GAA GCA AAG CTC GGC TGG GTC 
Met Glu Val Tyr Lys Ala Lys Phe Gly Glu Ala Lys Leu Gly Trp Val 
15 10 15 

GTT CTG GTT CAT GGC CTC GGC GAG CAC AGC GGA AGG TAT GGA AGA CTG 
val Leu Val Bis Gly Leu Gly Glu His ser Gly Arg Tyr Gly Arg Leu 
20 25 30 

ATT AAG GAA CTC AAC TAT GCC GGC TTT GGA GTT TAC ACC TTC GAC TGG 
lie Lys Glu Leu Asn Tyr Ala Gly Phe Gly Val Tyr Thr Phe Asp Trp 
35 40 45 

CCC GGC CAC GGG AAG AGC CCG GGC AAG AGA GGG CAC ACG AGC GTC GAG 
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Pro Gly Hie Gly Lye Ser Pro Gly Lye Arg Gly His Thr Ser Val Glu 
SO 55 so 

GAG GCG ATG GAA ATC ATC GAC TCG ATA ATC GAG GAG ATC AGO GAG AAG 
Glu Ala Met Glu lie lie Asp Ser lie lie Glu Glu He Arg Glu Lys 
65 *?0 75 80 

CCC TTC CTC TTC GGC CAC AGC CTC GGT GGT CTA ACT GTC ATC AGG TAC 
Pro Phe Leu Phe Gly His Ser Leu Gly Gly Leu Thr Val He Arg Tyr 
85 90 95 

GCT GAG ACG CGG CCC GAT AAA ATA CGG GGA TTA ATA GCT TCC TCG OCT 
Ala Glu Thr Arg Pro Asp Lys He Arg Gly Leu Zle Ala Ser Ser Pro 
100 105 no 

GCC CTC GCC AAG AGC CCG GAA ACG CCG GGC TTC ATG GTG GCC CTC GCG 
Ala Leu Ala Lys Ser Pro Glu Thr Pro Gly Phe Met Val Ala Leu Ala 
115 120 125 

AAG TTC CTT GGA AAG ATC GCC CCG GGA GTT GIT CTC TCC AAC GGC ATA 
Lys Phe Leu GXy Lys He Ala Pro Gly Val Val Leu Ser Asa Gly lie 
130 135 140 

AAG CCG GAA CTC CTC TCG AGG AAC AGG GAC GCC GTG AGG AGG TAC GTT 
Lys Pro Glu Leu Leu Ser Arg Asn Arg Asp Ala Val Arg Arg Tyr Val 
145 150 155 160 

GAA GAC CCA CTC GTC CAC GAC AGG ATT TCG GCC AAG CTG GGA AGG AGC 
Glu Asp Pro Leu Val His Asp Arg He Ser Ala Lys Leu Gly Arg Ser 
165 170 17S 

ATC TTC GTG AAC ATG GAG CTG GCC CAC AGG GAG GCG GAC AAG ATA AAA 
He Piie Val Asn Met Glu Leu Ala His Arg Glu Ala Asp Lys He Lys 
180 185 190 

GTC CCG ATC CTC CTT CTG ATC GGC ACT GGC GAT GTA ATA ACC CCG CCT 
Val Pro He Leu Leu Leu He Gly Thr Gly Asp Val He Thr Pro Pro 
195 200 205 

GAA GGC TCA CGC AGA CTC TTC GAG GAG CTG GCC GTC GAG AAC AAA ACC 
Glu Gly Ser Arg Arg Leu phe Glu Glu Leu Ala Val Glu Asn Lys Thr 
210 215 220 

CTG AGG GAG TTC GAG GGG GCG TAC CAC GAG ATA ITT GAA GAC CCC GAG 
Leu Arg Glu Phe Glu Gly Ala Tyr His Glu He Phe Glu Asp Pro Glu 
225 230 235 240 

TGG GCC GAG GAG TTC CAC GAA ACA ATT GTT AAG TGG CTG GTT GAA AAA 
Trp Ala Glu Glu Phe His Glu Thr He Val Lys Trp Leu Val Glu Lys 
245 250 255 

TCG TAC TCT TCG GCT CAA TAA 
Ser Tyr Ser Ser Ala Gin 
260 


(2) INFORMATION FOR SEQ ID NO:29: 


(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 750 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
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TTO ATT GGC AAT TIG AAA .TTG AAO AGO TTT GAA GAG GTT AAC TTA GTT 
Leu lie Gly Asa Leu Lys JUeu Lys Arg Pbe Glu Glu Val Asn Leu Val 
1 S 10 15 

CTT TCG GGA GGG GCT GCC AAG GOT ATC GCC GAT ATA GGT GTT TTA AAA 
Leu Ser Gly Gly Ala Ala Lys Gly lie Ala Bis lie Gly Val Leu Lys 
20 25 30 

GCT CTG GAA GAG CTC GGT ATA AAG GTA AAG AGG CTC AGO GGG GTA AGT 
Ala Leu Glu Glu Leu Gly Xle Lys Val Lys Arg Leu ser Gly Val Ser 

35 40 45 

GCT GGA GCT ATC GTT TCC GTC TTT TAC GCT TOG GGC TAC ACT CCC GAC 
Ala Gly Ala He Val Ser Val Phe Tyr Ala Ser Gly Tyr Thr Pro Asp 
50 55 60 

GAG ATG TTA AAA CTC CTG AAA GAG GTA AAC TGG CTC AAA CTT TTT AAG 
Glu Met Leu Lys Leu Leu Lys Glu Val Asn Trp Leu Lys Leu Phe Lys 
65 70 75 80 

TTC AAA ACA CCG AAA ATG GGC TTA ATG GGG TGG GAG AAG GCT GGA GAG 
Plie Lys Thr Pro Lys Met Gly Leu Met Gly Trp Glu Lys Ala Ala Glu 
05 90 95 

TTT TIG GAA AAA GAG CTC GGA GTT AAG AGG CTG GAA GAC CTG AAC ATA 
Pbe Leu Glu Lys Glu Leu Gly Val Lys Arg Leu Glu Asp Leu Asn lie 
100 105 110 

CCA ACC TAT CTT TGC TCG GCG GAT CTG TAC ACG GGA AAG GCT CTT TAC 
Pro Thr Tyr Leu Cys Ser Ala Asp Leu Tyr Thr Gly Lys Ala Leu Tyr 
115 120 125 

TTC GGC AGA GGT GAC TTA ATT CCC GTG CTT CTC GGA AGT TGT TCC ATA 
Phe Gly Arg Gly Asp Leu lie Pro val Leu Leu Gly Ser Lys Ser lie 
130 135 140 

CCC GGG ATT TTT GAA CCA GTT GAG TAC GAG AAT TTT CTA CTT GTT GAC 
Pro Gly lie Phe Glu Pro Val Glu Tyr Glu Asn Phe Leu Leu Val Asp 
14S ISO 155 160 

GGA GGT ATA GTG AAC AAC CTG CCC GTA GAA CCT TTG GAA AAG TTC AAA 
Gly Gly lie Val Asn Asn Leu Pro Val Glu Pro Leu Glu Lys Phe Lys 
165 170 175 

GAA CCC ATA ATC GGG GTA GAT GTG CTT CCC ATA ACT GAA GAA AGA AAG 
Glu Pro lie lie Gly Val Asp Val Leu Pro He Thr Gin Glu Arg Lys 
180 185 190 

ATT AAA AAT ATA CTC CAC ATC CTT ATA AGG AGC TTC TTT CTG GCG GTT 
He Lye Asn Xle Leu His He Leu He Arg Ser Phe Phe Leu Ala Val 
195 200 205 

CGT TCC AAT TCG GAA AAG AGA AAG GAG TTC TGC AAC GTA GTT ATA GAA 
Arg Sex Asn Ser Glu Lys Arg Lys Glu Phe Cys Asn Val Val Xle Glu 
210 215 220 

CCT CCC CTT GAA GAG TTC TCT CCT CTG GAC GTA AAT AAG GCG GAC GAG 
Pro Pro Leu Glu Glu Phe Ser Pro Leu Asp Val Asn Lys Ala Asp Glu 
225 230 235 240 


ATA TTC TGC GGG GAT ATG AGA GGA CTT TAA 
He Phe Cys Gly Asp Met Arg Ala Leu 
245 


(2) INFORMATION FOR SEQ ID NO:30: 
(i) SEQUENCE CHARACTERISTICS 
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10 


(A) LENGTH: 1017 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 
(xi) SEQUENCE DESCRIPTION: SEQ In NO:30: 

ATG CCA GCT AAT GAC TCA CCC ACS ATC GAC TIT AAT CCT CGC GGC ATT 48 
Met Pro Ala Asn Asp Ser Pro Thr He Asp Phe Asn Pro Arg Gly lie 
IS 10 15 

* CIT CGC AAC GCT CAC GCA CAG GTT ATT TTA GCO ACT TCC GGC TTG CGC 96 
15 Leu Arg Asn Ala His Ala Gin Val He Leu Ala Thr Ser Gly Leu Arg 

20 25 30 

AAA GCG TTT TTG AAA CGC ACG CAC AAG AGC TAC CTC AGC ACT GCC CAA 144 
Lyo Ala Phe Leu Lys Arg Thr His Lys Ser Tyr Leu Ser Thr Ala Gin 
35 40 45 

20 TGG CTG GAG CTC GAT GCC GGC AAC GGA GTT ACC TTG GCC GGA GAG CTT 192 

Trp Leu Glu Leu Asp Ala Gly Asn Gly Val Thr Leu Ala Gly Glu Leu 
50 55 60 

AAC ACA GCG CCT GCA ACT GCA TCC TCC TCC CAC CCG GCG CAC AAG AAC 240 
Asn Thr Ala Pro Ala Thr Ala Ser Ser Ser His Pro Ala His Lyo Asn 
25 65 70 75 80 

ACT CTG GTT ATT GTG CTG CAC GGC TGG GAA GGC TCC AGC CAG TCG GCC 288 
Thr Leu Val He Val Leu His Gly Trp Glu Gly Ser Ser Gin Ser Ala 
85 90 95 

TAT GCG ACC TCC GCT GGC AGC ACG CTT TTC GAC AAT GGG TTC GAC ACT 336 
Tyr Ala Thr Ser Ala Gly Ser Thr Leu Phe Asp Asn Gly Phe Asp Thr 
100 105 110 

TTT CGC CTT AAT TTT CGC GAT CAC GGC GAC ACC TAC CAC TTA AAC CGC 384 
Phe Arg Leu Asn Phe Arg Asp His Gly Asp Thr Tyr His Leu Asn Arg 
115 120 125 

35 

GGC ATA TTT AAC TCA TCG CTG ATT GAC GAA GTA GTG GGC GCA GTC AAA 432 
Gly He Phe Asn Ser Ser Leu He Asp Glu Val Val Gly Ala Val Lys 
130 135 140 

GCC ATC CAG CAG CAA ACC GAC TAC GAC AAG TAT TGC CTG ATG GGG TTC 480 
A* Ala He Gin Gin Gin Thr Asp Tyr Asp Lys Tyr Cys Leu Ket Gly Phe 

145 ISO 155 160 

TCA CTG GGT GGG AAC TTT GCC TTG CGC GTC GCG GTG CGG GAA CAG CAT 528 
Ser Leu Gly Gly Asn Phe Ala Leu Arg Val Ala Val Arg Glu Gin His 
165 170 175 

45 CTC GCT AAA CCG CTA GCG GGC GTG CTC GCC GTA TGC CCG GTA CTC GAC 576 

Leu Ala Lys Pro Leu Ala Gly Val Leu Ala Val Cys Pro Val Leu Asp 
180 185 190 

CCC GCA CAC ACC ATG ATG GCC CTA AAC CG» GGT GCG TTT TTC TAC GGC 624 

50 


55 
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Pro Ala His Thr Met Net Ala lieu Asa Arg 01/ Ala Phe Phe Tyr Gly 

195 200 205 

CGC TAT TTT GCG CAT AAA TCG AAG CGC TCG TEA ACC GCA AAA CTT GCA 
Arg Tyr Phe Ala His Lye Trp Lys Arg Ser Leu Thr Ala Lys Leu Ala 
210 215 220 225 

OCT TTC CCA GAC TAC AAA TAC GGC AAA GAT TTA AAA TOO ATA CAC ACQ 
Ala Phe Pro Asp Tyr Lys Tyr Oly Lys Asp Leu Lys Ser lie Bis Thr 
230 235 240 

CTT GAT OAO TTA AAC AAC TAT TTC ATT CCC CGC TAC ACC GGC TTC AAC 
Leu Asp Glu Leu Aon Asn Tyr Phe He Pro Arg Tyr Thr Oly Phe Asn 
245 250 255 

TCA GTC TCC GAA TAC TTC AAA AGT TAC ACQ CTC ACC GGG GAG AAG CTC 
Ser Val ser Glu Tyr Phe Lys Ser Tyr Thr Leu Thr Gly Gin Lys Leu 
260 265 270 

GCG TTT CTC AAC TGC CCC AGT TAC ATT CTG GCA GCT GGC GAC GAC CCA 
Ala Phe Leu Asn Cys Pro Ser Tyr lie Leu Ala Ala Gly Asp Asp Pro 
275 200 285 

ATA ATT CCA GCA TCC GAC TTT CAG AAA ATA GCC AAG CCT GCG AAT CTG 
lie lie Pro Ala Ser Asp Phe Gin Lys lie Ala Lys Pro Ala Asn Leu 
290 295 300 305 

CAC ATA ACA GTA ACG CAA CAA GGT TCT CAT TGC GCA TAC CTG GAA AAC 
His lie Thr Val Thr Gin Gin Gly Ser His Cys Ala Tyr Leu Glu Asn 
310 315 320 

CTG CAT AAA CCT AGT GCT GCC GAC AAA TAT GCG GTG AAA TTA TTT GGA 
Leu His Lys Pro Ser Ala Ala Asp Lys Tyr Ala Val Lys Leu Phe Gly 
325 330 335 

GCC TGT TGA 
Ala Cys 


(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 936 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
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ATG CTT GAT ATG CCA ATC GAC CCT GTT TAC TAC CA6 CTT GCT GAG TAT 48 
Met Leu Asp Met Pro lie Asp Pro Val Tyr Tyr Oln I*eu Ala Glu Tyr 
1 5 10 IS 

5 TTC GAC ACT CTG CCG AAG TTC GAC CAO TTT TCC TCG GCC AGA GAG TAC 3€ 

Pbe Asp Ser Leu Pro I-ys Phe Asp Gin Phe Ser Ser Ala Arg Glu Tyr 

20 25 30 

AGG GAG GCG ATA AAT CGA ATA TAC GAG GAG AGA AAC CGG CAG CTG AGC 144 
Arg Glu Ala lie Asa Arg lie Tyr Glu Glu Arg Asa Arg Gin Leu Ser 

10 35 40 45 

CAG CAT GAG AGG GTT GAA AGA GTT GAG GAC AGG ACG ATT AAG GGG AGG 192 
Gin His Glu Arg Val Glu Arg Val Glu Asp Arg Thr lie Lya Gly Arg 
50 55 60 


15 


20 


25 


30 


35 


40 


45 


50 


55 
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AAC GGA GAC ATC AGA GTC A6A GTT TAC CAG CAG AAG CCC GAT TCC CCG 
Asn Gly Asp lie Arg Val Arg Val Tyr Gin Gin Lys Pro Asp Ser Pro 
65 70 75 80 

GGT CTG GTT TAC TAT CAC GGT GGT GGA TTT GTG ATT TGC AGC ATC GAG 
Val Leu val Tyr Tyr His Gly Gly Gly Phe val lie Cya Ser lie Glu 
85 90 95 

TOG CAC GAC GCC TTA TCC AGG AGA AYY GOG AGA CTT TCA AAC TCT ACC 
Ser His Asp Ala Leu Cys Arg Arg He Ala Arg Leu Ser Asn Ser Thr 
100 105 X10 

GTA GTC TCC GTG GAT TAC AGG CTC GCT OCT GAG CAC AAG TTT CCC CCC 
Val Val Ser Val Asp Tyr Arg Leu Ala Pro Glu His Lys Phe Pro Ala 

1X5 120 125 

CCA GTT TAT CAT TGC TAC GAT GCG ACC AAG TGG GTT GCT GAG AAC CGG 
Ala Val Tyr Asp Cys Tyr Asp Ala Thr Lys Trp Val Ala Glu Asn Ala 
130 135 140 

GAG GAG CTG AGG ATT GAC CCG TCA AAA ATC TTC GTT GGG GGG GAC ACT 
Glu Glu Leu Arg lie Asp Pro Ser Lys lie Phe Val Gly Gly Asp Ser 
145 150 155 160 

GOG GGA CGG AAT CTT GCC CCG GCG CTT TCA ATA ATG GCG AGA GAC AGC 
Ala Gly Gly Asn Leu Ala Ala Ala val Ser lie Met Ala Arg Asp Ser 
165 170 175 

GGA GAA GAT TTC ATA AAG CAT CAA ATT CTA ACT TAC CCC GTT GTG AAC 
Gly Glu Asp Phe lie Lys His Gin He Leu He Tyr Pro Val Val Asn 
180 185 190 

TTT GTA GCC CCC ACA CCA TOG CTT CTG GAG TTT GGA GAG GGG CIO TGG 
Phe Val Ala Pro Thr Pro Ser Leu Leu Glu Phe Gly Glu Gly Leu Trp 
195 200 205 

ATT CTC GAC CAG AAG ATA ATG ACT TGG TTC TCG GAG CAG TAC TTC TCC 
lie Leu Asp Gin Lys Zle Met Ser Trp Phe Ser Glu Gin Tyr Phe Ser 
210 215 230 

AGA GAG GAA GAT AAG TTC AAG CCC CTC GCC TCC GTA ATC TTT GCG GAC 
Arg Glu Glu Aso Lys Phe Asn Pro Leu Ala Ser Val lie Phe Ala Asp 

235 240 245 250 

CTT GAG AAC CTA CCT CCT GCG CTG ATC ATA ACC GCC GAA TAC GAC CCG 
Leu Glu Asn Leu Pro Pro Ala Leu He He Thr Ala Glu Tyr Asp Pro 
255 260 265 

CTG AGA GAT GAA GGA GAA GTT TTC GGG CAG ATG CTG AGA AGA GCC GGT 
Leu Arg Asp Glu Gly Glu Val Phe Gly Gin Met Leu Arg Arg Ala Gly 
270 375 280 

GTT GAG GCG AGC ATC GTC AGA TAC AGA GGC GTG CTT CAC GGA TTC ATC 
Val Glu Ala Ser Zle Val Arg Tyr Arg Gly Val Leu His Gly Phe He 
285 290 295 

AAT TAC TAT CCC GTG CTG AAG GCT GCG AGG GAT GCG ATA AAC CAG ATT 
Asn Tyr- Tyr Pro Val Leu Lys Ala Ala Arg Asp Ala Zle Asn Gin Zle 
300 305 310 

GCC GCT CTT CTT GTG TTC GAC TAG 
Ala Ala Leu Leu Val Phe Asp 
315 320 


(2) INFORMATION FOR SEQ ID NO:32: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 918 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 


35 


10 


15 


25 


30 


40 


45 
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(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

ATG CCC CIA GAT CCT AG* ATT AAA AAO TTA CTA 6AA TCA GCT CTT ACT 49 
Met Pro Leu Asp Pro Arg lie Lya Lys Leu Leu Glu Ser Ala Leu Thr 
5 10 is 


ATA CCA ATT GCT AAA GCC CCA CTA GAA GAG CTA AGA AAG ATA TTT AGG 
lie Pro lie Gly Vya Ala Pro Val Glu Glu Val Arg Lys lie Phe Aro 
20 25 30 


96 


CAA TTA GOG TOG OCA GCT CCC AAA GTC &tt GTT GGA AAA GTA GAA GAT 144 
Gin Leu Ala Ser Ala Ala Pro Lya Val Glu Val Gly Lys Val Glu Asp 
35 40 45 

ATA AAA ATA CCA GGC ACT GAA ACC GTT ATA AAC GCT AGA GTG TAT TTT 192 
lie Lya lie Pro Gly Ser Glu Thr Val lie Asn Ala Arg Val Tyr Phe 

20 50 S5 60 

CCG AAG ACT AGO GOT CCT TAT GOV GTT CTA GTG TAT CTT CAT GGA GGC 240 
Pro Lys Ser Ser Gly Pro Tyr Gly Val Leu Val Tyr Leu Bis Gly Gly 
65 70 75 80 


GCT TTT GTA ATA GGC GAT GTG GAA TCT TAT GAC OCA TTA TCT AGA GCA 288 
Gly Phe Val He Gly Asp Val Glu Ser Tyx Asp Pro Leu Cys Arg Ala 
85 90 95 

ATT ACA AAT GOG TGC AAT TGC GTT GTA CTA TCA GTG GAC TAT AGG TTA 336 
He Thr Asn Ala Cys Asn Cys Val Val val Ser Val Asp Tyr Arg Leu 
100 105 110 

GCT CCA GAA TAC AAG TTT CCT TCT GCA GTT ATC GAT TCA TTT GAC GCT 384 
Ala Pro Glu Tyr Lys Phe Pro Ser Ala Val He Asp Ser Phe Asp Ala 
115 120 125 

ACT AAT TGG GTT TAT AAC AAT TTA GAT AAA TTT GAT GGA AAG ATO GGA 432 
Thr Asn Trp val Tyr Asn Asn Leu Asp Lys Phe Asp Gly Lys Met Gly 
35 130 135 140 

GTT GOG ATT GCG GGA GAT ACT GCT GGA GGA AAT TTG GCA GCG GTT GTA 480 
Val Ala tie Ala Gly Asp Ser Ale Gly Gly Asn Leu Ala Ala Val Val 
145 150 155 160 


GCT CTT CTT TCA AAG GCT AAA ATT AAT TTG AAG TAT CAA ATA CTG GTT 528 
Ala Leu Leu Ser Lys Gly Lys He Asn Leu Lys Tyr Gin He Leu Val 
165 170 175 

TAC CCA GCG GTA ACT TTA GAT AAC GTT TCA AGA TCC ATG ATA GAG TAC 57$ 
Tyr Pro Ala Val Ser Leu Asp Asn Val Ser Arg Ser Met He Glu Tyr 
180 IBS 190 

TCT GAT GGG TTC TTC CTT ACC AGA GAG CAT ATA GAG TGG TTC GCT TCT 524 
Ser Asp Gly Pne Phe Leu Thr Arg Glu His He Glu Trp Phe Gly Ser 
135 200 205 


so 


55 


36 


EP 0 880 590 B1 


CAA TAC TTA CGA AGC CCT GCA GAT TIG CTA GAC TTT AGG TTC TCT CCA 
Gin Tyr lieu Arg Ser Pro Ala Asp Leu Leu Asp Phe Arg Phe Ser Pro 
210 215 220 

ATT CTO GCG CAA GAT TTC AAC GGA TTA CCT CCA GCC TIG ATA ATA ACA 
He Leu Ala Gin Asp Phe Asn Gly Leu Pro Pro Ala Leu lie lie Thr 
225 230 235 240 

GCA GAA TAC GAT CCA CTA AGG GAT CAA GGA GAA GCG TAT GCA AAT AAA 
Ala Glu Tyr Asp Pro Leu Arg Asp Gin Gly Glu Ala Tyr Ala Asn Lys 
245 250 255 

CTA CTA CAA GCT GGA GTC TCA GTT ACT AGT GTG AGA TTT AAC AAC GTT 
Leu Leu Gin Ala Gly val Ser Val Thr Ser Val Arg Phe Asn Asn Val 

260 2SS 270 

ATA GAC GGA TTC CTC TCA TTC TTT CCO TIG ATG GAG CAA GGA AGA GAT 
He Bis Gly Phe Leu Ser Phe Phe Pro Leu Met Glu Gin Gly Arg Asp 
275 280 285 

GCT ATA GGT CTC ATA GGG TCT GTG TTA AGA GGA GZA TTT TAT GAT AAA 
Ala He Gly Leu He Gly Ser Val Leu Arg Arg Val Phe Tyr Asp Lys 
290 295 300 

ATT TAA 

He 

305 


(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 184 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
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Met Ser Leu Asn Lys His Ser Trp Met Asp Met He lie Phe He Leu 
15 10 15 

Ser Phe Sex Phe Pro Leu Thr Met He Ala Leu Ala He Ser Met Ser 

5 20 25 .30 

Ser Trp Phe Asn He Trp Asn Asn Ala Leu Ser Asp Leu Gly His Ala 
35 40 45 

Val Lys Ser Ser Val Ala Pro He Phe Asn Leu Gly Leu Ala He Gly 

10 SO 55 60 

Gly He Leu He Val He Val Gly Leu Arg Asn Leu Tyr Ser Trp Ser 
€5 70 75 80 


15 


20 


25 


30 


35 


Arg Val Lys Gly Ser Leu He He Ser Met Gly Val Phe Leu Asn Leu 
85 90 95 

He Gly Val Phe Asp Glu Val Tyr Gly Trp He His Phe Leu Val Ser 
100 105 110 

Val Leu Phe Phe Leu Ser lie He Ala Tyr Phe He Ala He Ser He 
115 120 125 

Leu Asp Lys Ser Trp He Ala Val Leu Leu He He Gly His He Ala 
130 135 140 


Met Trp Tyr Leu His Phe Ala Ser. Glu He Pro Arg Gly Ala Ala He 
145 150 155 160 

Pro Glu Leu Leu Ala Val Phe Ser Phe Leu Pro Phe Tyr He Arg Asp 
1«5 170 175 

Tyr Phe Lys Ser Tyr Thr Lys Arg 

180 

(2) INFORMATION FOR SEQ ID NO:34: 
(i) SEQUENCE CHARACTERISTICS 


(A) LENGTH: 346 AMINO ACIDS 

(B) TYPE: AMINO ACID 
40 (D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

45 


50 


55 
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Met Lys Leu lieu Glu Pro Thr Asn Thr ser Tyx Thr Leu Leu Gin Asp 
15 10 is 

Leu Ala Leu His Phe Ala Phe Tyr Trp Phe Leu Ala Val TYr Thr Trp 
20 25 30 

Leu Pro Gly Val Leu val Arg Gly Val Ala Val Asp Thr Gly Val Ala 
35 40 45 

Arg Val Pro Gly Leu Gly Arg Arg Gly Lys Arg Leu Leu Leu Ala Ala 
50 55 60 

Val Ala Val Leu Ala Leu Val Val Ser Val Val Val Pro Ala Tyr Val 
65 70 75 80 

Ala Tyr Ser Ser Leu His Pro Glu Ser Cya Arg Pro Val Ala Pro Glu 
85 90 95 

Gly Leu Thr Tyr Lys Glu Phe Ser Val Thr Ala Glu Asp Gly Leu Val 
100 105 110 

Val Arg Gly Trp Cal Leu Gly Pro Gly Ala Gly Gly Asn Pro Val Phe 
115 120 125 

Val Leu Met His Gly Tyx Thr Gly Cya Arg Ser Ala Pro Tyr Met Ala 
130 135 140 

Val Leu Ala Arg Glu Leu Val Glu Trp Gly Tyr Pro Val Val Val Phe 
145 150 155 160 

Asp Phe Arg Gly His Gly Glu Ser Gly Gly Ser Thr Thr He Gly Pro 
165 170 175 

Arg Glu Val Leu Asp Ala Arg Ala Val Val Gly Tyr Val Ser Glu Arg 
180 185 190 

Phe Pro Gly Arg Arg He He Leu Val Gly Phe Ser Met Gly Gly Ala 
195 200 205 

Val Ala He Val Glu Gly Ala Gly Asp Pro Arg Val Tyr Ala Val Ala 
210 215 220 

Ala Asp Ser Pro Tyr Tyr Arg Leu Arg Asp Val He Pro Arg Trp Leu 
225 230 235 240 


Glu Tyr Lys Thr Pro Leu Pro Gly Trp Val Gly Val Leu Ala Gly Phe 
245 250 255 

Tyr Gly Arg Leu Met Ala Gly Val Asp Leu Gly Phe Gly Pro Ala Gly 
260 265 270 

Val Glu Arg Val Asp Lys Pro Leu Leu Val Val Tyr Gly Pro Arg Asp 
275 280 285 

Pro Leu Val Thr Arg Asp Glu Ala Arg Ser Leu Ala Ser Arg Ser Pro 
290 295 300 

Cys Gly Arg Leu Val Glu Val Pro Gly Ala Gly His Val Glu Ala Val 
305 310 315 320 

Asp Val Leu Gly Pro Gly Arg Tyr Ala Asp Met Leu He Glu Leu Ala 
325 330 335 

His Glu Glu Cys Pro Pro Gly Ala Gly Gly 
340 345 
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(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 262 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 


Met Pro Tyr Val Arg Asn Gly Gly Val Asn lie Tyr Tyr Glu Leu Val 
15 10 15 

Asp Gly Pro Glu Pro Pro lie Val Phe Val His Gly Trp Thr Ala Asn 
20 25 30 

Met Asn Phe Trp Lys Glu Gin Arg Arg Tyr Phe Ala Gly Arg Asn Met 

35 40 45 

Met Leu Phe Val Asp Asn Arg Gly His Gly Arg Ser Asp Lys Pro Leu 
50 55 60 

Gly Tyr Asp Phe Tyr Arg Phe Glu Ash Phe lie Ser Asp Leu* Asp Ala 
65 70 75 80 

val Val Arg Glu Thr Gly val Glu Lys Phe val Leu val Gly His Ser 
85 90 95 

Phe Gly Thr Met lie Ser Met Lys Tyr Cys Ser Glu Tyr Arg Asn Arg 
100 105 110 

Val Leu Ala Leu lie Leu lie Gly Gly Gly Ser Arg He Lys Leu Leu 
115 120 125 

His Arg lie Gly Tyr Pro Leu Ala Lys lie Leu Ala Ser lie Ala Tyr 
130 13S 140 

Lys Lys Ser Ser Arg Leu Val Ala Asp Leu Ser Phe Gly Lys Asn Ala 
145 150 155 160 

Gly Glu Leu Lys Glu Trp Gly Trp Lys Gin Ala Met Asp Tyr Thr Pro 

165 170 X7S 


Ser Tyr Val Ala Met Tyr Thr Tyr Arg Thr Leu Thr Lys Val Asn Leu 

100 185 ISO 

Glu Asn He Leu Glu Lys He Asp Cys Pro Thr Le\i He He Val Gly 
195 200 205 

Glu Glu Asp Ala Leu Leu Pro Val Ser Lys Ser Val Glu Leu Ser Arg 
210 215 ' 220 

Arg He Glu Asn Ser Lys Leu Val He He Pro Asn Ser Gly His Cys 
225 230 235 240 

Val Met Leu Glu Ser Pro Ser Glu Val Asn Arg Ala Met Asp Glu Phe 
245 250 255 

He Ser Ser Ala Gin Phe 
260 
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(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 251 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 


Leu Arg Leu Arg Lys Phe Glu Glu He Asn Leu Val Leu Ser Gly Gly 
1 5 10 15 

Ala Ala Lys Gly lie Ala His He Gly Val Leu Lys Ala lie Asn Glu 
20 25 30 

Leu Gly He Arg val Arg Ala Leu Ser Gly Val Ser Ala Gly Ala He 
35 40 45 

Val Ser Val Phe Tyr Ala Ser Gly Tyr Ser Pro Glu Gly Met Phe Ser 
50 55 eo 

Leu Leu Lys Arg Val Aen Trp Leu Lys Leu Phe Lys phe Lye Pro Pro 
65 70 75 80 

Leu Ly3 Gly Leu lie Gly Trp Glu Lys Ala lie Arg Phe Leu Glu Glu 
85 90 55 

Val Leu Pro Tyr Arg Arg He Glu Lys Leu Glu He Pro Thr Tyr He 
100 105 no 

Cys Ala Thr Asp Leu Tyr Ser Gly Arg Ala Leu Tyr Leu Ser Glu Gly 
ilS 120 125 

Ser Leu He Pro Ala Leu Leu Gly Ser Cys Ala He Pro Gly He Phe 
130 135 140 

Glu Pro Val Glu Tyr Lys Asn Tyr Leu Leu Val Asp Gly Gly He Val 
1^5 150 155 1€0 

Asn Asn Leu Pro Val Glu Pro Phe Gin Glu Ser Gly He Pro Thr Val 
1^5 170 175 

Cys Val Asp val Leu Pro He Glu Pro Glu Lys Asp He Lys Asn He 
ISO 185 190 


Leu His lie Leu Leu Arg Ser Phe Phe Leu Ala Val Arg Ser Asn Ser 
195 200 205 

Glu Lys Arg Lys Glu Phe Cys Asp Leu Val He Val Pro Glu Leu Glu 
21° 215 220 

Glu Phe Thr Pro Leu Asp Val Arg Lys Ala Asp Gin He Met Glu Arg 
225 230 235 240 

Gly Tyr He Lys Ala Leu Glu Val Leu Ser Glu 
2*5 250 


(2) INFORMATION FOR SEQ ID NO:37: 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH: 297 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 


Met Phe Asa lie Asn Val Phe Val Asa lie Ser Trp Leu Tyr Phe Ser 
1 5 10 IS 

Gly lie Val Met Lys Tor Val Glu Glu Tyr Ala Leu Leu Olu Tar Gly 
20 25 30 

Val Arg Val Phe Tyr Arg Cys Val He Pro Glu Lys Ala Phe Asn Thr 
35 40 45 

Leu He He Gly Ser His Gly Leu Gly Ala His Ser Gly He Tyr He 
50 55 $0 

Ser Val Ala Glu Glu Phe Ala Arg His Gly Phe Gly Phe Cys Met His 
65 70 75 60 

Asp Gin Arg Gly His Gly Arg Tor Ala Ser Asp Arg Glu Arg Gly Tyr 
85 90 95 

Val Glu Gly Phe His Asn Phe He Glu Asp Met Lys Ala Phe Ser Asp 
100 105 110 

Tyr Ala Lys Trp Arg Val Gly Gly Asp Glu lie He Leu Leu Gly His 
115 120 125 

Ser Met Gly Gly Leu He Ala Leu Leu Thr Val Ala Thr Tyr Lys Glu 
130 135 140 

He Ala Lys Gly Val He Ala Leu Ala Pro Ala Leu Gin He Pro Leu 
145 150 155 160 

Thr Pro Ala Arg Arg I^eu Val Leu Ser Leu Ala Ser Arg Leu Ala Pro 
165 170 175 

His Ser Lys He Thr Leu Gin Arg Arg Leu Pro Gin Lys Pro Glu Gly 
ISO 185 190 

Phe Gin Arg Ala Lys Asp He Glu Tyr Ser Leu Ser Glu He Ser Val 
195 200 205 

Lys Leu Val Asp Glu Met He Lys Ala Ser Ser Met Phe Trp Thr He 
210 215 220 


Ala Gly Glu He Asn Thr Pro Val Leu Leu He His Gly Glu Lys Asp 
225 230 235 240 

Asn Val He Pro Pro Glu Ala Ser Lys Lys Ala Tyr Gin Leu He Pro 
245 250 255 

Ser Pn* Pro Lys Glu Leu Lys He Tyr Pro Asp Leu Gly His o Asn Leu 
260 265 270 

Phe Phe Glu Pro Gly Ala Val Lys He Val Thr Asp He Val Glu Trp 

275 2S0 285 

Val Lys Asn Leu Pro Arg Glu Asn Pro 
290 295 


42 


EP 0 880 590 B1 


10 


20 


25 


30 


35 


40 


45 


50 


55 


(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 262 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 


Met Glu Val Tyr l*ys Ala Lys Phe Gly Glu Ala Lys Leu Gly Trp Val 
15 1 S 10 15 

Val Leu Val His Gly Leu Gly Glu His Ser Gly Arg Tyr Gly Arg Leu 
20 25 30 


Zle Lys Glu Leu Asn Tyr Ala Gly Phe Gly Val Tyr Thr Phe Asp Trp 
35 40 45 

Pro Gly His Gly Lys Ser Pro Gly Lys Arg Gly Bis Thr Ser Val Glu 
50 55 60 

Glu Ala Met Glu He lie Asp Ser He Zle Glu Glu He Arg Glu Lys 
65 70 75 80 

Pro Phe Leu Phe Gly Kid Ser Leu Gly Gly Leu Thr Val lie Arg Tyr 
85 90 ' 95 

Ala Glu- Thr Arg ?ro Asp Lys He Arg Gly. Leu lie Ala Ser Ser Pro 
100 105 110 

Ala Leu Ala Lys Ser Pro Glu Thr Pro Gly Phe Met Val Ala Leu Ala 
115 120 125 

Lys Phe Leu Gly Lys He Ala Pro Gly Val Val Leu Ser Asn Gly He . 
130 135 140 

Lys Pro Glu Leu Leu Ser Arg Asn Arg Asp Ala Val Arg Arg Tyr Val 
145 150 155 160 

Glu Asp Pro Leu Val His Asp Arg He Ser Ala Lys Leu Gly Arg Ser 
165 170 17S 

He Phe Val Asn Met Glu Leu Ala His Arg Glu Ala Asp Lys He Lys 
ISO 195 190 

Val Pro He Leu Leu Leu He Gly Thr Gly Asp Val He Thr Pro Pro 
195 200 205 


Glu Gly Ser Arg Arg Leu Phe Glu Glu Leu Ala Val Glu Asn Lys Thr 

210 215 220 

Leu Arg Glu Phe Glu Gly Ala Tyr His Glu lie Phe Glu Asp Pro Glu 
225 230 23S 240 

Trp Ala Glu Glu Phe Hi 9 Glu Thr He Val Lys Trp Leu Val Glu Lys 
245 250 255 

Ser Tyr Ser Ser Ala Gin 
260 


(2) INFORMATION FOR SEQ ID NO:39: 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 249 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 


Leu He Gly Asn Leu Lys Leu Lys Arg Phe Glu Glu Val Acn Leu Val 
15 id 15 

Leu Ser Gly Gly Ala Ala Lys Gly He Ala aid He Gly Val Leu Lys 
20 25 30 

Ala Leu Glu Glu Leu Gly He Lys Val Lys Arg Leu Ser Gly Val Ser 
35 40 45 

Ala Gly Ala He Val Ser Val Phe Tyr Ala Ser Gly Tyr Thr Pro Asp 
50 ss co 

Glu Met Leu Lys Leu Leu Lys Glu Val Asn Trp Leu Lys Leu Phe Lys 
-*5 70 75 80 

Phe Lys Thr Pro Lys Met Gly Leu Met Gly Trp Glu Lys Ala Ala Glu* 
B5 90 95 

Phe Leu Glu Lys Glu Leu Gly Val Lye Arg Leu Glu Asp Leu Asn He 
100 105 110 

Pro Thr 'Tyr Leu Cys Ser Ala Asp Leu Tyr Thr Gly Lys Ala Leu Tyr 
115 120 125 

Phe Gly Arg Gly Asp Leu He Pro Val Leu Leu Gly Ser Lys Ser He 

130 135 140 

Pro Gly He Phe Glu Pro Val Glu Tyr Glu Asn Phe Leu Leu Val Asp 
145 150 155 1G0 

Gly Gly He Val Asn Asn Leu Pro Val Glu Pro Leu Glu Lys Phe Lys 
165 170 175 

Glu Pro He He Gly Val Asp Val Leu Pro He Thr Gin Glu Arg Lys 
ISO 185 190 

He Lye Asn He Leu His He Leu He Arg Ser Phe Phe Leu Ala Val 
195 200 205 

Arg Ser Asn Ser Glu Lys Arg Lys Glu Phe Cys Asn Val Val He Glu 

210 215 220 


Pro Pro Leu Glu Glu Phe Ser Pro Leu Asp Val Asn Lys Ala Asp Glu 
225 230 235 240 

He Phe Cys Gly Asp Met Arg Ala Leu 
245 

(2) INFORMATION FOR SEQ ID NO:40: 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH: 339 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 


Met Pro Ala Asn Asp Ser Pro Tor lie Asp Phe Asn Pro Argr Gly He 
15 io is 

Leu Arg Asn Ala His Ala Gin Val He Leu Ala Thr Ser Gly Leu Arg 
20 25 30 

Lys Ala Pbe Leu Lys Arg Thr His Lys Ser Tyr Leu Ser Thr Ala Gin 
35 40 45 

Trp Leu Glu Leu Asp Ala Gly Asn Gly Val Thr Leu Ala Gly Glu Leu 
50 55 60 

Asn Thr Ala Pro Ala Thr Ala Ser Ser Ser His Pro Ala Bis Lys Asn 
65 70 75 SO 

Thr Leu Val He Val Leu His Gly Trp Glu Gly Ser Ser Gin Ser Ala 
BS 90 $5 

Tyr Ala Thr Ser Ala Gly Ser Thr Leu Phe Asp Asn Gly Phe Asp Thr 
100 105 110 - 

Phe Arg Leu Asn Phe Arg Asp His Gly Asp Thr Tyr His Leu Asn Arg 
115 120 125 

Gly He Phe Asn Ser Ser Leu He Asp Glu Val Val Gly Ala Val Lys 
130 135 140 

Ala He Gin Gin Gin Thr Asp Tyr Asp Lys Tyr Cys Leu Met Gly Phe 
145 150 155 160 

Ser Leu Gly Gly Asn Phe Ala Leu Arg Val Ala Val Arg Glu Gin His 
165 170 175 

Leu Ala Lys Pro Leu Ala Gly Val Leu Ala Val Cys Pro Val Leu Asp 
180 185 190 

Pro Ala His Thr Met Met Ala Leu Asn Arg Gly Ala Phe Phe Tyr Gly 
195 200 205 

Arg Tyr Phe Ala His Lys Trp Lys Arg Ser Leu Thr Ala Lys Leu Ala 
210 215 220 225 

Ala Phe Pro Asp Tyr Lys Tyr Gly Lys Asp Leu Lys Ser He His Thr 
230 235 240 

Leu Asp Glu Leu Asn Asn Tyr Phe He Pro Arg Tyr Thr Gly Phe Asn 
245 250 255 
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Ser Val Ser Glu Tyr Pbo Lys Ser Tyr Thr Leu Tor Gly Gin Lye Leu 
250 255 270 

Ala Pbe Leu Asa Cys Pro Ser Tyr He Leu Ala Ala Gly Asp Asp Pro 
5 275 280 285 

He He Pro Ala Ser Asp Phe Gin Lys He Ala Lys Pro Ala Aan Leu 
290 295 300 305 

His He Tor Val Thr Gin Gin Gly Ser Hie Cys Ala Tyr Leu Glu Asn 
10 310 315 320 

Leu His Lys Pro Ser Ala Ala Asp Lys Tyr Ala Val Lys Leu Phe Gly 
32S 330 335 

Ala Cys 

15 


(2) INFORMATION FOR SEQ ID NO:41: 


(i) SEQUENCE CHARACTERISTICS 

20 

(A) LENGTH: 311 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

25 (ii) MOLECULE TYPE: PROTEIN 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 


30 


35 


40 
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Met Leu Asp Met Pro He Asp Pro Val Tyr Tyr Gin Leu Ala Glu Tyx 
15 10 15 

Phe Asp Ser Leu Pro Lys Phe Asp Gin Phe Ser Ser Ala Arg Glu Tyr 
20 25 30 

Arg Glu Ala He Asn Arg He Tyr Glu Glu Arg Asn Arg Gin Leu Ser 
35 40 45 

Gin His Glu Arg Val Glu Arg Val Glu Asp Arg Thr He Lys Gly Arg 

50 55 60 

Asn Gly Asp He Arg Val Arg Val Tyr Gin Gin Lys Pro Asp Ser Pro 

65 70 75 60 

Val Leu Val Tyr Tyr His Gly Gly Gly Phe Val He Cys Ser He Glu 

85 90 - ... . 95 

Ser His Asp Ala Leu Cys Arg Arg He Ala Arg Leu Ser Asn Ser Thr 
100 105 110 

Val Val Ser Val Asp Tyr Arg Leu Ala Pro Glu His Lya Phe Pro Ala 
115 120 X25 

Ala Val Tyr Asp Cys Tyr Asp Ala Thr Lys Trp Val Ala Glu Asn Ala 
130 135 140 

Glu Glu Leu Arg lie Asp Pro Ser Lys He Phe Val Gly Gly Asp Ser 
145 150 155 160 

Ala Gly Gly Asn Leu Ala Ala Ala Val Ser He Met Ala Arg Asp Ser 
165 170 175 

Gly Glu Asp Phe He Lys His Gin He Leu He Tyr Pro Val Val Asn 
180 185 190 

Phe Val Ala Pro Thr Pro Ser Leu Leu Glu Phe Gly Glu Gly Leu Trp 


195 200 205 

He Leu Asp Gin Lys He Met Ser Trp Phe Ser Glu Gin Tyr Phe Ser 
210 215 230 

Arg Glu Glu Asp Lys Phe Asn Pro Leu Ala Ser Val He Phe Ala Asp 
235 240 245 250 

Leu Glu Asn Leu Pro Pro Ala Leu He He Thr Ala Glu Tyr Asp Pro 
255 260 265 

Leu Arg Asp Glu Gly Glu Val Phe Gly Gin Met Leu Arg Arg Ala Gly 
270 275 200 

Val Glu Ala Ser He Val Arg Tyr Arg Gly Val Leu His Gly Phe He 
285 290 295 

Asn Tyr Tyr Pro Val Leu Lys Ala Ala Arg Asp Ala He Asn Gin He 
300 305 310 

Ala Ala Leu Leu Val Phe Asp 
315 320 


55 (2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS 


47 


EP 0 880 590 B1 


(A) LENGTH: 305 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:42: 


Met Pro Leu Asp Pro Arg He Lys Lys Leu Leu Glu Ser Ala Leu Thr 
5 10 15 

He Pro lie Gly Lys Ala Pro Val Glu Glu Val Arg Lys He Phe Arg 
20 25 30 

Gin Leu Ala Ser Ala Ala Pro Lys Val Glu Val Gly Lya Val Glu Asp 
35 40 45 

He Lys lie Pro Gly Ser Glu Thr Val He Aan Ala Arg Val Tyr Phe 
50 55 60 

Pro Lys Ser Ser Gly Pro Tyx Gly Val Leu Val Tyx Leu His Gly Gly 

65 70 75 0O 

Gly Phe val He Gly Asp Val Glu Ser Tyr Asp Pro Leu Cys Arg Ala 
85 90 95 

He Thr Asn Ala Cys Asn* Cys Val Val Val Ser Val Asp Tyr Arg Leu 
100 105 no 

Ala Pro Glu Tyr Lys Phe Pro Ser Ala Val He Asp Ser Phe Asp Ala 
115 120 125 

Thr Asn Trp Val Tyr Asn Asn Leu Asp Lys Phe Asp Gly Lys Met Gly 
130 135 140 

Val Ala He Ala Gly Asp Ser Ale Gly Gly Asn Leu Ala Ala Val Val 
145 150 155 160 
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Ala Leu Leu Ser Lys Gly Lys lie Asn Leu Lya Tyr Gin lie Leu Val 
1« 170 175 

Tyr Pro Ala Val Ser Leu Asp Asn Val Ser Arg Ser Met lie Glu Tyr 
180 185 190 

Ser Asp Gly Phe Phe Leu Thr Arg Glu His lie Glu Trp Phe Gly Ser 
195 200 205 

Gin Tyr Leu Arg Ser Pro Ala Asp Leu Leu Asp Phe Arg Phe Ser Pro 
210 215 220 

lie Leu Ala Gin Asp Phe Asn Gly Leu Pro Pro Ala Leu He He Thr 
225 230 21S 240 

Ala Glu Tyr Asp Pro Leu Arg Asp Gin Gly Glu Ala Tyr Ala Asn Lys 
245 250 255 

Leu Leu Gin Ala Gly Val Ser Val Thr Ser Val Arg Phe Asn Asn Val 
2S0 265 270 

lie His Gly Phe Leu Ser Phe Phe Pro Leu Met Glu Gin Gly Arg Asp 
275 2B0 265 

Ala He Gly Leu He Gly Ser Val Leu Arg Arg Val Phe Tyr Asp Lys 
290 295 300 

Xle 
305 


(2) INFORMATION FOR SEQ ID NO:43: 


(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 605 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 


49 


EP 0 880 590 B1 


ATG AAO GTT AAA CAC GTT ATT GTT TEA CAT CGC TTA TAT ATG TCT GGC 

Met Lys val Lys His Val lie val Leu His Gly Leu Tyr Met Ser Oly 
as 10 is 

TTG GTQ ATG CGC COO TTA TCT TCO CGT CTA GAA GA8 TOO GGG GTT AAA 
Leu val Met Arg Pro Leu Cys ser Arg Leu Olu Olu ser Oly Val Lys 
20 2S 30 

GTT TTA AAC TTA ACC TAC AAT ACT CCA GAC CCT AAT CGA GAT GCT ATT 
Val Leu Asn Leu Thr Tyr Asn Thr Arg Asp Pro Asn Arg Asp Ala lie 
35 40 45 

TTT ACQ GAA ATA GAT GAG TTT ATT ACC AAT GAG OCT TCT GCT TTA OTG 
Pbe Thr Gin He Asp Glu Phe He Ser Asn Glu pro Ser Ala Leu Val 

50 55 60 

TGT CAC TCT ATG GGG GGC TTA GTT GCT CGC GCC TAT TTA GAG GCA AAC 
Cys Hia ser Met Gly Gly Leu Val Ala Arg Ala Tyr Leu Glu Ala Aan 
65 70 75 80 

TGA CC0 CCA ACT CAT CAT GTT GAA AAG OTA ATC ACC TTA GGA ACQ CCA 
Ser Ala Pro Ser Hia Bio Val Glu Lys Val He Thr Leu Gly Thr Pro 
65 a 90 95 

CAT ACT GGC AOC CAT ATT GCT GAA AAA ATG GA0 GAA AAA GGG TTC GAG 
His Thr Gly Ser His He Ala Glu Lys Met Gin Gin Lya Gly Phe Glu 

100 105 HO 


CTA TTA TTA AAA AAT AGC GTT GAG TTT TTA CTC TCT AAG AAT GOT GAT 
Leu Leu Leu Lys Asn ser Val Glu Phe Leu Leu Ser Lys Asn Gly Asp 
115 120 125 

TOO CCT TTT AAA GCC AAG CTA TAT AGC ATT GCC GGC GAC TTA CCO ATT 
Trp Pro Phe Lys Ala Lys Leu Tyr Ser He Ala Gly Asp Leu Pro He 
130 135 140 

GGC TTA ATG CCA CTC ATT GTA AAA GGC ACC CGC TCT GAT GGC ACT CTA 
Gly Leu Mac Pro Leu He Val Lya Gly Ser Arg Ser Asp Oly Thr Val 
145 150 155 160 

TTG CTA GAT GAA ACC AAG CTA AAG GOT ATO GCT GAA CAC AAG GTG TTT 
Leu Leu Asp Glu Thr Lys Leu Lya Gly Mat Ala Glu Bis Lya Val Phe 
165 170 175 

CAT TTA AGC CAT ACA ACT ATG ATT TAC TCT O0C GAA GTC GTT AAT TAT 
Hia Leu Ser Hia Thr Ser Met He Tyr Ser Arg Gin Val Val Aan Tyr 
180 185 190 

ATT CTT GAG CGC TTG AAC GAG GAC ATT TA 
He Leu Glu Arg Leu Asn Glu Asp He 
195 200 


(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 779 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
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ATO ATA AAA AAC TTC QAC* AGA GAA AAT TCT AGC TTA OTA CTO TCC OCT 
Met lie Lys Asn Phe Asp Arg Olu Asn Ser Sex Leu Val Leu Sex Oly 
1 S 10 is 

GOT GOT OCT CTG GOT ATT GCT CAC TTG GOT OTA CTG CAT GAC CTT GAA 
Oly Gly Ala Leu Gly He Ala His Leu Gly Val Leu His Asp Leu Glu 
20 25 30 

AAA CAA AAT ATT OTA CCA AAT GAA ATT OTT GOT ACA ACT ATO GOT OGT 
Lys Oln Asn lie Val Pro Asn Olu He Val Oly Thr Ser Met Oly Gly 
35 40 45 

ATC ATT GOT GCA TCT ATO OCT ATC GOO ATO AAA GAG AAA GAA ATA CTC 
He He Gly Ala Ser Met Ala He Gly Met Lys Glu Lys Olu He Leu 
50 • 55 60 

GAA GAA ATC AAA AAC TTT TCC AAT GTC TTC AAC TOO ATA AAA TTC TCT 
Glu Glu He Lys Asn Phe Ser Asn val Phe Asn Tip He Lys Phe Ser 

65 70 75 80 

TTT TCC OOT AAT TCT CTT GTC GAT AAC GAG AAO ATC GCT AAG ATA TTT 
Phe £er Oly Asn Ser Val Val Asp Asn Glu Lys He Ala Lys He Phe 
85 90 95 

GAT ACT CTT TTT AAA QAC AQA AAG ATO ACA GAT ACQ GIG ATC CCT CTT 
Asp Thr Leu Phe Lys Asp Arg Lys Met Thr Asp Thr Val He Pro Leu 
100 105 110 

AAA CTC ATC OCT ACA AAC TTA CAT AAT GGA CAT AAA AAA GTA TTT ACT 
Lys Leu He Ala Thr Asn Leu His Asn Gly His Lys Lys Val Phe Thr 
115 120 JL25 

GCT TOO GAT GAT GTA CTO ATC AAA GAT GCA ATA CTC TCA ACA ATO OCA 
Ala Ser Asp Asp val Leu He Lys Asp Ala He Leu Ser Thr Met Ala 
130 135 140 

ATA CCC GOT GTA TTT GAA GAA CAT ATT ATT GAT GOT GAA ACC TAT GGC 
He Pro Oly val Phe Glu Glu His He He Asp Gly Glu Thr Tyr Oly 
145 150 155 160 


GAC GOT TTT CTT TCT GAA AAC CTT GOT GTO AAT GAG GCA ACA TTC AAT 
Asp Gly Phe Leu Cys Glu Asn Leu Gly Val Asn Glu Ala Thr Phe Asn 
16$ 170 175 

GAT GTT TTA GCT GTA OAT CTC ATO GOT GAG AAC TCT TTT GAA AAA GCA 
Asp Val Leu Ala Val Asp Val Met Oly Olu Asn Ser Phe Olu Lys Ala 
160 185 190 

ATO CCC GAC AAC TTC TTT AAA ACA TCA AAT OTT TTA GAA ATO TTT GAA 
Met Pro Asp Asn Phe Phe Lys Thr Ser Asn Val Leu Glu Met Phe Olu 
195 200 20$ 

AAA TCA ATO CGA CTT TTT ATT TAC AAC CA0 ACA CAO ACA CAT ATT AAA 
Lys Ser Met Arg Leu Phe He Tyr Asn Oln Thr Oln Thr His He Lys 
210 215 220 

AAT GCA AAT AAA AAT ATT TAT CTT ATT GAA CCC OTT ACC AAA GAG TAT 
Asn Ala Asn Lys Asn He Tyr Leu He Glu pro Val Thr Lys Glu Tyr 
225 230 235 240 

AAA ACA TTT CAA TTT CAT AAA CAT AAA GAG ATA CCT GCT TTA GGC TIG 
Lys Thr Phe Gin Phe His Lys His Lys Olu He Arg Ala Leu Gly Leu 
245 250 255 

GOT TTA CTC TO 
Gly Leu Leu 


(2) INFORMATION FOR SEQ ID NO:45: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 905 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 
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(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 


ATO CCC CTA CAT CCA AAO QTA AAO AAA TTA CTT TCC CAO CTA CCT CCC 48 
Met Pro Lou Bis Pro Lys Val Lye Lyo Leu Leu Ser Gin Leu Pro Pro 
15 10 15 

C3U5 GAC TTC TCC AOA AAC GIG CAG GAC CIG AGG AAO GCC TGG GAT TTA 96 
Gin Asp Phe Sex Arg Aon Val Gin Asp Leu Arg Lyo Ala Trp Asp Leu 
10 20 25 30 

CCC TTC TCA GGG AGG AGG GAG ACC CTG AAO AGG GTT. GAG GAC CTT GAG 144 
Pro Phe Ser Gly Arg Arg Olu Tor Leu Lys Arg val Glu Asp Leu Glu 
35 40 45 

ATA CCC ACT AGG GAC GCA CGA ATC AGG GCC AGG GTC TAG ACC CCC TCA * 192 " 
15 lie Pro Thr Arg Asp Ala Arg lie Arg Ala Arg Val Tyr Thr Pro Ser 

50 55 60 

AGT AAO GAA AAC TTA CCC GTC CTT GTT TAC TAT CAC GGC GOT GOC TTC 240 
Ser Lys Glu Asn Leu Pro Val Leu Val Tyr Tyr His Gly Gly Gly Phe 
65 70 75 80 

GTC TTC GOT AGC GTT GAC AGC TAC GAC GGC CTC GCA TCC CTT ATT GCC 288 
20 val Phe Gly Ser V&l Asp Ser Tyr Aap Gly Leu Ala Ser Leu lie Ala 

85 90 95 

AAG GAA TCP GGG ATT GOO GTT ATC TCC CTC GAG TAT AGG CTC GCC CCT 336 
Lys Glu Ser Gly He Ala Val He Ser Val Glu Tyr Arg Leu Ala Pro 
100 105 110 

25 GAG CAC AAG TTC CCC ACC CCA GTC AAC GAC TOO TGG GAT GCO CTT CTC 384 

Glu Bis Lys Phe Pro Thr Ala Val Asn Asp Ser Trp Asp Ala Leu Leu 
115 120 125 

TGG ATC GOO GAG AAC GGA GGC AAG CTG GGG CTC GAC ACC TOG AGA CTT 432 
Trp He Ala Glu Asn Gly Gly Lys Leu Gly Leu Asp Thr Ser Arg Leu 
130 135 140 

30 GCC CTG GCT GGG GAT AGT GCT GGA GGA AAC CTG TCT GCC GTG GTG TCC 480 

. Ala Val Ala Gly Asp Ser Ala Gly Gly Asn Leu Ser Ala Val Val Ser 
14S 150 155 160 


35 


40 


45 


50 


55 


52 
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CTC CTG GAC AGO GAC CAO GST AAG GGA CTO GTT ACT TAT CAO OTC CTA 528 
Leu Leu Asp Arg Asp oln Oly Lys Oly Leu Vol Ser Tyr Gin VaX Leu 
165 170 175 

ATC TAC CCA OCA GTO AAC ATO OTC GAT AAC TCC CCA TCC OTC AOO OAO 576 
lie Tyr Pro Ala Val Asn Met Val Asp Asa Ser Pro Ser Val Arg Glu 
180 185 190 

TAC GOC OAO GGA TAC TTC CTC ACC AGO TCC ATG ATO AAC TOO TTC OOQ 624 

Tyr Gly Glu Oly Tyr Phe Leu Thr Arg ser Ket Net Asa Trp Phe Oly 
195 200 205 

ACC ATO TAC TTC TCC TCT OGA AGG OAA GCQ OTA TCC CCC TAC GCC TCT 672 

Thr Net Tyr Phe ser Ser Gly Arg Olu Ale v&l Ser Pro Tyr Ale Ser 
210 215 220 

CCA GCC TTO OCT GAC CTA CAT AAC CTC CCA CCC TCA CTO OTO ATC ACT 730 
Pro Ale Leu Ala Asp Leu Bis Asn Leu Pro Pro Ser Leu Val lie Thr 
225 230 235 240 

OCA GAG TAT GAT CCC CIA AGO GAT £AO GGA OAO ACC TAC TCT CAC TCC 768 
Ale Olu Tyr Asp Pro Leu Arg Asp Gin Oly Olu Thr Tyr Ser His Ser 
345 350 255 

CTA AAC OAO GCT GGA AAC OTA TCA ACC TTO GTT AGA TAT GAA GGA ATO 616 
Leu Asn Olu Ala Oly Asn val Ser Thr Leu Val Arg Tyr Oln Oly Met 
20 2 *° 265 270 

ATT CAC GGC TTC CTG TCC TTC TAC GAO TOO ATA ACT GCC GOT AAA CTA 864 
tie His Oly Phe Leu Ser Phe Tyr Olu Trp He Thr Ala Oly Lys Leu 
275 380 285 


10 


15 


25 


30 


GCC ATT CAC CAC ATT GCT OGO GTT CTG AGA TCT GTC CTT TA 905 
Ala He His His He Ala oly Val Lue Arg Ser Val Leu Arg Ser Val 
290 295 300 

Leu 301 


(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 978 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

35 (C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 


45 


50 


55 


53 


EP 0 880 590 B1 


10 


OTO GCC TTC TTC GAT ATO CCC CTT GAG GAA CTG AAA AAG TAG CGG CCT 48 
Val Ala Phe Phe Asp Met Pro Leu Glu Glu . Leu Lys Lys Tyr Arg Pro 

1 '5 id" 13 

GAA AGO TAG GAG GAG AAA GAT TTC CAT GAG TTC TGG AGG GAA AGA CTT 96 
Glu Arg Tyr Glu Glu Lys Asp Phe Asp Glu phe Trp Arg Glu Tte Leu 
20 25 30 

AAA GAA AGC GAA GGA TTC CCT CTG GAT CCC GTC TTT GAA AAG GTO GAC 144 
Lys Glu Ser Glu Gly phe Pro Leu Asp Pro Val Phe Olu Lys Val Asp 
35 40 45 

TTT CAT CTC AAA ACQ GTT GAA ACG TAC GAT GTT ACT TTC TCT GGA TAC 192 
Phe His Leu Lys Thr Val Glu Thr Tyr Asp Val Thr Phe Ser Gly Tyr 
50 55 60 

AGG GGG GAG AGA ATA AAG GGC TGG CTT CTT GTT COG AAG TTO GCO GAA 240 
Arg Gly Gin Arg Zle Lyo Gly Trp Leu Leu val Pro Lys Leu Ala Glu 
65 70 75 80 

15 GAA AAG CTT CCA TCC GTC GTG CAG TAC ATA GOT TAC AAT GOT GGA AGG 280 

Glu Lye Leu Pro Cys Val Val Gin Tyr Zle Gly Tyr Aan Gly Gly Arg 
85 90 95 

GGT TTT CCA CAC GAC TGG CTG TTC TGG CCC TCA ATG GOT TAC ATC TGT 336 
Gly Phe Pro His Asp Trp Leu Phe Trp Pro Ser Het Gly Tyr He Cys 
100 105 no 

20 


25 


30 


35 


40 


45 


50 


55 


54 
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rrr gtc ato gac acc ago cgg gag gga agc gqc tgg atg aao gga gac 384 

Phe Val* Met Asp Thr Arg Gly Gin Gly Ser Gly Trp Her Lys Oly Asp 
X1S 120 125 

ACA COO GAT TAC CCT QAO GOT OCA GTC GAT CCA CAG TAC CCC GGA TTC 432 
Thr pro Asp Tyr Pro Glu Oly Pro Val Asp pro Gin Tyr Pro Oly Phe 
130 135 140 

AUG ACQ AGO GOC ATT CTG GAT COG GGA ACC TAT TAC TAC AGO CGA GTC 480 
Net Thr Arg Oly lie Leu Asp Pro Oly Thr Tyr Tyr Tyr Arg Arg val 
145 150 155 160 

TTC OTO GAT GCG GTC AGO GOO GT9 GAA GGA GCC ATT TCC TTC CCG AGA 520 
Phe Val Asp Ala val Arg Ala Val Glu Ala Ala He ser Phe Pro Arg 
165 no 17S 

GTG GAT TCC AGO AAG GTG GTO GTO GCC GGA GGC ACT CAG GOT GGG GGA 576 
Val Asp Ser Arg Lys Val Val Val Ala Gly Gly Ser Gin Gly Oly oly 
180 185 190 

ATC CCC CTT GCG GTG ACT GCC CTG TOO AAC AGG GTG AAO OCT CTG CTC 624 
zie Pro Leu Ala Val Ser Ala Leu Ser Asn Arg Val Lys Ala Leu Leu 
155 200 205 

TGC GAT GTG CCG TTT CTG TGC CAC TTC AGA AGO GCC GTG CAA CTT GTC 672 
- Cys Asp Val Pro Phe Leu Cys His Phe Arg Arg Ala Val Ola Leu Val 
210 215 220 

GAC ACA CAC CCA TAC GTG GAG ATC ACC AAC TTC CTC AAA ACC CAC AGO 720 
Asp Thr His pro Tyr Val Glu lie Thr Asn Phe Leu Lys Thr His Arg 
225 230 235 240 

GAC AAA GAG GAG ATT GTT TTC AGA ACA CTT TCC TAC TTC GAT GGT GTG 76B 
Asp Lys Glu Glu lie Val Phe Arg Thr Leu Ser Tyr Phe Asp Gly Val 
245 250 255 

AAC TTT GCA GCA AGO GCA AAG GTG CCC GCC CTG TTT TCC GTT GGG CTC S16 
Asn Phe Ala Ala Arg Ala Lys Val Pro Ala Leu Phe Ser Val Gly Leu 
260 265 370 

ATG GAC ACC ATC TGT CCT CCC TOO ACQ GTC TTC GCC OCT TAC AAC CAC 664 
Met Asp Thr xie Cys Pro Pro ser Thr Val Phe Ala Ala Tyr Asa His 

30 275 280 285 

TAC GCC GGT CCA AAO GAG ATC AGA ATC TAT CCG TAC AAC AAC CAC GAA 912 
Tyr Ala Gly Pro Lys Glu lie Arg He Tyr Pro Tyr Asn Asn His Glu 
290 295 300 


10 


15 


20 


25 


35 


40 


GGT GGA GGT TCT TTC CAG GCA ATT GAG CAG GTG AAA TTC TTO AAG AGA 960 
Gly Gly Gly Ser Phe Gin Ala He Glu Gin Val Lys Phe Leu Lys Arg 
305 310 315 320 

CTA TTT GAG GAA GOC TAG 978 
Leu Phe Qlu Glu Gly 
325 


(2) INFORMATION FOR SEQ ID NO:47: 
(i) SEQUENCE CHARACTERISTICS 

45 (A) LENGTH: 879 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

50 (ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 


55 
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56 


(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 914 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 


EP 0 880 590 B1 


15 


30 


35 


45 


(Xl) SEQUENCE DESCRIPTION: SEQ-ID KO:48: 

ATO AGC AAA TTC CCA ATA CTC TOO GCO TTQ ATA ACO OCA TAC CTO CCO 40 
Met Ser Lya Phe Ala lie lieu Trp Ala Leu He Thr Ala Tyr Leu Pro 

5 1 5 10 * 15 

OAA CCT CTC ATG AAA CTO GXA TAT TTA OOQ CGO CGC GAA ACO CTT GCG 94 
Olu Pro Val Met Lya Leu Val Tyr Leu Oly Arg Arg Clu Thr Leu Cly 
30 25 . _ jo 

Sf* 25° 5" P 10 F 1 ? *** ^ 010 000 m GCC AAT 144 

Ala Arg Thr Leu Asp Val Lya Ala Gin Ala Val Oly Arg Leu Ala Asn 
10 35 40 45 

GCA ACA ASA CCT GTC CGC CTO ATT CCO ACO GTC OAC OAA AGC CGC AAG 193 
Ala Thr Arg Pro Val oly Val lie Pro Thr Val Olu Clu Ser Arg Lys 
50 55 60 

ATO ACO CAT AAA GCC CTT AGC CTT TTT GAT CAO CCC CCC CCC OAA TTA 240 
Met Thr Aop Lys Ala Val Ser Leu Phe Asp Ola Pro Ala Pro Olu Leu 
65 70 75 80 

TTC CCT AAA AAA OAC ATT CAO ATT OAC CGC CCT GAA GCG CCT ATT CAT 280 
Phe Arg Lys Lye Asp lie Gin He Asp Oly Ala Olu Gly Pro He Asp 
05 90 95 

20 CCC CCT ATT TAC AGC CCC CCT CCA AAA CAT CCC CCO CBn CCA ATA CIA 336 

Ala Arg lie Tyr ser Cly Pro Ala Lys His Arg Pro Arg pro lie Leu 
100 105 no 

GTC TAT TTT CAC GCC GGT CGC TGG OTT CAO GCC AAT CTO GAC AGC CAT 304 
val Tyr Phe Bis Oly Cly Oly Trp Val Oln Oly Aim Leu Asp Ser His 

115 120 125 


OAC GCO CTT TGC GCC AAG CTO GCA AAA TOO CCC AAC TOC ATT GTT ATC 432 
Asp Gly val Cys Gly Lya Leu Ala Lys Trp Ala Aon cys He Val He 
130 135 140 

TOO GTC GAT TAT COT CTA GOG CCC GAA CAC AAA TTT CCT TGT CCO CCO 480 
Ser Val Asp Tyr Arg Leu Ala Pro Clu His Lya Phe Pro Cys Ala Pro 
145 ISO 155 160 

CTT GAT GCO ATT GCO GCC TAT AAA. TGG GTC CGC GCC AAC GCA ACA AAC 528 
Leu Asp Ala He Ala Ala Tyr Lya Trp Val Arg Ala Asn Ala Thr Asa 

165 170 175 

CTT GCC GCC GAT CCT GAA CGT ATC GGC GTT GCC CCC GAT AGC GCA GGO 576 
Leu Gly Gly Asp Pro Clu Arg He Gly Val Cly Oly Asp Ser Ala Oly 
1B0 185 190 

GGC AAT CTT CCC GCC GTT GTC TGC CAA CAA ACC CCC ATO AAC GGC 0A0 624 
Oly Asn Leu Ala Ala Val Val Cys Oln Oln Thr Ala Met Asn Oly Glu 
195 200 205 

- CGC ACA CCA GAT CTO CAA GTC CTO ATC TAT CCO GCO CTO GAT CCA CCC 672 
Arg Thr Pro Asp Leu Gin Val Leu lie Tyr Pro Ala Leu Asp Ala Are 
40 210 215 220 

ATG ATC TCG ACC TOG ATC GAC CAA TTC CGT CAT GCC TAC ATC TTO CCO 720 
Met He Ser Thr Ser Met Clu Glu Leu Arg Asp Ala Tyr He Leu Pro 
225 230 235 340 


AAA TCC AGA ATO GAG TAT TTC CTC GCC CTA TAT ACO COT CGC CCT CAC 768 
Lya Ser- Arg Met Glu Tyr Phe Leu Gly Leu Tyr Thr Arg Oly Pro Asp 
245 2S0 255 

GAT ATC GAG GAC CTT AGC ATG TCG CCA ATT CTC AGG GAT ACC CTC CCO 816 
Asp He Clu Aap Leu Arg Met Ser Pro He Leu Arg Asp Thr Val Ala 
260 26S 270 

GAT CAA CCC CAA GCC TOC ATT CTC ACC TGT GCC TTT GAC CCT GCO COA 864 
50 Asp Oln Pro Oln Ala Cys He Val Thr Cys Gly Phe Asp Pro Ala Arg 

275 280 285 

COA CGO GAA CAC CTA CGC GCA ACG CTT AAT TCC CGA GCG GAT ACA CCT 913 
Arg Arg Clu His Leu Arg Arg Thr Leu Asn Cys Arg Cly Asp Arg Arg 

290 29S 300 . 

55 171 S14 


(2) INFORMATION FOR SEQ ID NO:49: 


58 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 926 NDCLEOTIDES 

(B) TYPE: NUCLSIC ACID 

5 (C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 


GTO AGC ATT GOT CTG CGA CIO TTA AAC TGO TTT TIG AAT ACC TTT GAA 48 
Val Ser He Arg Leu Arg Leu Leu Asa Trp Phe Leu Asn Thr Phe Olu 
15 10 is 

/5 

AAA CCA AAA, CIO OCC 000 GCC AAA ACQ COO OAT G&T TTO CGA AAA TOO 9S 
Lys Pro Lys Leu Ala Ala Ala Lys Thr Pro Asp Asp Leu Arg Lys Ser 
20 25 SO 

TTT GAA TTA AAO OCO A6G TTT TIQ TTT 000 OCO CCA COT AAA ACA AGO 144 
Phe Olu Leu Lys Ala Arg Pbe Leu Phe Pro Ala Pro Arg Lys Thr Arg 

2Q 55 40 45 " 

TTT ACT CAT OAT GTA TTO CAO TCA OGC ATC GGO TOG OTA AAT GCC CAG 192 
Phe Ser Ria Asp Val Leu Gin Ser Oly He Oly Ser val Asn Ala Gin 

50 55 €0 

TOO GCO AAA TCC AAA TCT OCA TCT GAT GAC AGO OTA ATC CTG TAT TTT 240 

Trp Ala Lys Ser Lys Ser Ala Ser Asp Asp Arg Val He Leu Tyr Phe 
25 65 70 75 60 

CAT GGG GGA GGG TAT OTT TTT GGG TCA CCA AAA AOC CAC COT GCA ATO 288 
His Gly Gly Gly Tyr Val Phe Oly Ser pro Lys Thr Hie Arg Ala Met 
85 90 95 

TTO GCC CGC TTG TCG CCA ATO ACA GOT CTT TCT GOB TGC CTT CCA GAT 336 
30 Leu Ala Arg Leu Ser Ala Met Thr Gly Leu ser Ala Cys Leu Pro Asp 

100 105 110 

TAT AGO TTG GCA CCA GAG CAC CCA TTT CCA GCC OCG ATC GAA GAT GCA 384 
Tyr Arg Leu Ala Pro Glu His Pro Phe Pro Ala Ala He Glu Asp Ala 
115 120 125 

OTT TTA TCG TAT AAA TOT TTA CTA GAG CGA CCA ATC GAG OCC GAA AAT 432 
35 Val Leu Ser Tyr Lys Cys Leu Leu Glu Arg Ala He Glu Pro Gin Asn 

130 135 140 

ATT ATA CTG GGG GOG GAC AOT OCT GGT OGC GOT TTO OTT CTT OCT TTG 480 
He He Leu Gly Gly Asp Ser Ala Gly Gly Oly Leu Val Leu Ala Leu 

145 150 155 140 

40 CTT GCA GAA ATC AAG GCC CAA TCC TTO CCC AAA OCT GCT OGC GTT TTT 528 

Leu Ala Glu He Lys Ala Oln Ser Leu Pro Lys Pro Ala Oly Val Phe 
165 170 175 

GCC TTO TCG CCT TTO GTT GAT TTA TCA TTT TOG GGC CTT TCG TTT TCT 57S 
Ala Leu ser Pro Leu Val Asp Leu Ser Phe Ser Gly Leu ser Phe Ser 
180 185 190 

45 AAA AAT GCC CAA ACC GAT GTG ATG TTG CCC GCA TCA CGO GCT GOO OAT €24 

Lys Asn Ala Oln Thr Asp Val Met Leu Pro Ala Ser Arg Ala Ala Asp 
195 200 20S 

ATO CCO ACC TTG TAT TTO GAT GGG GCC GAT GCA GAT GAT CCA COT GCA 672 

Met Ala Thr Leu Tyr Leu Asp Gly Ala Asp Ala Asp Asp Pro Arg Ala 

210 215 220 

50 TCG CCO CTG CAO OCG GAT TTT TCT CCC ATG CCO CCT OTA TTT CTG ACA 720 

Ser Pro Leu Gin Ala Aep Phe ser Gly Met Pro Pro val phe Leu Thr 
225 230 235 240 

GCA ACT GAC ACT GAA ATC CTG TTG GAT GAT TOO CTG OGG ATG GOG GAT 768 
Ala Ser Asp Ser Glu He Leu Leu Asp Asp Cys Leu Arg Met Ala Asp 
245 250 255 

55 

CAC TTG COT GCG CAA GOT OTC GTT GTG ACA GAC COG ATT GTT GAA AAC 816 
His Leu Arg Ala Gin Oly Val val val Thr Asp Arg He val Glu Asn 
260 265 270 
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10 


30 


35 


40 


45 


50 


55 


CAT CCA CAT GTT TGG CAT ATT TXT CAA OGC CTT CTA CCC 6AA GGA GAT 664 
Hie Pro Bis Val Trp His zle Foe Gin Arg Leu Leu pro Glu Ala Aep 
275 380 285 

CAO GGG CTQ CGO GOG ATT GCC GOG TGG ATT AAA CCT CTT TEA TCA GGT 912 
aln Gly Leu Arg Ala He Ala Ala Trp Zle Lye Pro Leu Leu Ser Gly 
290 295 300 

TCA AAC GAA AGC TA 926 

Ser Aon Glu Ser 

305 


(2) INFORMATION FOR SEQ ID NO:50: 
(i) SEQUENCE CHARACTERISTICS 

15 (A) LENGTH: 713 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

20 (ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 


25 


60 
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ATC CTT AC* TTT AAT GTT TTA TAT GOT AT3 ATQ AAA CAA AAA CTA GCA 48 
K6C Leu Thr Phe Asn Val Leu Tyr Oly Ken Mec Lys Gin x.ya I*iu Ala 
» 5 10 is 

OCA ATT CTC ATQ TTT TTA GGG CTA TGA GCA GCA GAG GCT CAA OAC TGG 96 
5 Ala lie Leu Met Phe Leu Gly Leu Ser Ala Ala Glu Ala Gin Asp Trp 

*° as 30 

CCT GAC CTA CAG AAA TAT OST AOT GCT AAT AAA GAA GCC AAA TTA CTT 144 
Pro Asp Leu Gin Lys Tyr Arg Ser Ala Ann Lys Glu Ala Lys Leu Leu 
35 40 45 

10 CCA AAO GAA AAC CGQ AAG GTG GTT TTT ATQ GGC AAC TCC ATT ACA GAA 152 

Pro Lys Glu Asn Arg Lys Val Val Phe Het Oly Asn Ser Xle Thr Glu 
SO 55 $0 

GCC TGG ATT AOT CAG CGA CCT GAG TTT TTT ACT GAA AAT GGG TTT ATC 240 
Ala Trp tie Ser Gin Arg Pro Glu Phe Phe Ser Glu Asn Gly Pise Zle 
^6S 70 75 30 

15 GGT CGA GGC ATC AOT GGC CAO ACA ACC CCT CMS ATT) TTG TTO AGA TEC 280 

Gly Arg Gly He Ser Gly Gin Thr Thr Pro Gin Met Leu Leu Arg Phe 
85 _ 90 95 

CGA CAO GAT GTG ATA GAC CTG CAG CCA AAG CCT GTA GTG ATA CTA GCT 335 
Arg Gin Asp val Zle Asp Leu Gin Pro Lys Ala Val val Xle I^u Ala 
100 105 110 

20 . 

GGT ACC AAT GAC GTA OCT CAA AAT ACC GGG CCG ATG ACC ATT GAG GAA 384 
Gly Thr Asn Asp Val Ala Gin Asn Thr Gly Pro Het Thr Xle Glu Glu 
115 120 125 

TCG CTT GCT AAC ATT AAG TCT ATC GTG GAG CTG GCG CAA GCC AAT GGG 432 
Ser Leu Ala Asn Xle Lys Ser Mec Val Glu Leu Ala Gin Ala Asn Gly 
25 130 135 140 

ATC ACQ CCT GTT TTG TGT ACC GTG CTG CCT GCA GAT CGT TTC AGC TGG 480 
Xle Thr Pro Val Leu Cys Thr val Leu Pro Ala Asp Arg Phe Ser Trp 
145 150 155 160 

CGA CCT GAG CTT ACA CCC GCA GAA ACT ATC ATT GCC CTC AAT CAG CTC 528 
Arg Pro Glu Leu Thr Pro Ala Glu Thr Xle Xle Ala Leu Asn Gin Leu 

30 165 170 175 

ATT AAG CAA TAT GCC GAG GCA CAG GGC CTG GCC CTG GTG GAT TAT CAT 576 
lie Lys Gin Tyr Ala Glu Ala Gin Gly Leu Ala Leu Val Asp Tyr His 
180 185 190 

GCT GCA CTC ACC AAT AAA GGT GGA GGA CTT COG GTG AAA TAC GGA GAA 624 
oc Ala Ala Leu Thr Asn Lys Gly Gly Gly Leu Pro val Lys Tyr Gly Glu 

195 200 205 


OAT GGT GTG CAT CCA AAT GTA GCA GGC TAT CAG GTG ATG GAA AAC ATT 672 
Asp Gly Val His Pro Asn Val Ala Gly Tyr Gin Val Met Glu Asn Xle 
210 215 220 

GTT TTA CCG GTC ATT TCC AGC GAG TTG GCA AAG CTG AAG TA 713 
45 Val Leu Pro Val lie Ser Ser Glu Leu Ala Lye Leu Lys 

225 230 235 


(2) INFORMATION POR SEQ ID NO:51: 
50 (i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 978 NOCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 
55 (D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 
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10 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 


ATO OCC TTC TTC GAT TTA CCA CTC OAA GAA CTG AAA AAA TAT COT CCA 49 
Met Ala Phe Phe Asp Leu Pro Leu Glu Glu Leu Lys Lys Tyr Arg Pro 
15 10 is 

GAG CGG XAC GAA GAG AAA GAC TTC GAT GAG TTC TGG GAA GAG ACA CTC 96 
Glu Arg Tyr GXu Glu Lya Asp Phe Asp Glu Phe Txp Olu Olu Thr Leu 
20 25 30 

OCA GAG AGC GAA AAG TTC CCC TTA GAC OCC (TTC TTC GAG AGO ATO GAG 144 
Ala Glu Ser Glu Lys Phe Pro Leu Asp Pro Val Pfce Glu Arg Met Glu 
35 40 45 

TCT CAC CTC AAA ACA GTC GAA GOG TAC GAT GTC ACC TTC TCC GGA TAC 192 
Ser His Leu Lya Thr Val Glu Ala Tyr Aop Val Thr Phe Ser Gly Tyr 
50 55 60 

15 AGO GGA GAG AGO ATC AAA GOG TGG CTC CTT GTT OCA AAA CTO GAA GAA 240 

Arg Gly Gin Arg He Lys Gly Trp Leu Leu Val Pro Lya Leu Glu Glu 
65 70 75 90 

GAA AAA CTT CCC TCC GTT GTG CAG TAC ATA GGA TAC AAC GOT GGA AGA 208 
Glu Lys Leu Pro Cys Val Val Gin Tyr He Gly Tyr Asn Gly Gly Arg 
85 90 95 

20 

GGA TTC CCT CAC GAC TGG CTG TTC TGG CCT TCT ATO GGT TAC ATA TOT 336 
Gly Phe' Pro Uis Asp Trp Leu Phe Trp Pro Ser Met Gly Tyr Zle Cys 
100 105 120 

TTC GTC ATO* GAT ACT CGA GOT CAG GGA AGC OGC TGG CTG AAA GGA GAC 384 
Phe val Met Asp Thr Arg Gly oln Gly Ser Gly Trp Leu Lys Gly Asp 
115 120 125 

ACA COG GAT TAC CCT GAG GGT CCC GTT GAC CCT CAG TAT CCA GGA TTC 432 
Thr Pro Asp Tyr Pro Olu oly Pro val Asp Pro Gin Tyr Pro Gly Phe 
130 135 1^0 

ATO ACA AGA GGA ATA CTG GAT CCC AGA ACT TAC TAC TAC AGA CGA GTC 480 
Met Thar Arg Gly He Leu Asp Pro Arg Thr Tyr Tyr Tyr Arg Arg Val 
145 150 155 160 

TTC ACQ GAC OCT GTC AGA GCC GTT GAA CCT OCT CCT TCT TTT CCT CAG 528 
Phe Thr Asp Ala Val Arg Ala Val Glu Ala Ala Ala Ser Phe Pro Gin 
165 170 175 

GXA GAT CAA GAA AGA ATC CTO ATA CCT CGA GGC AST CAG GGT GGC COA 576 
Val Asp Gin Glu Arg He Val He Ala Gly Gly Ser Gin Gly Gly Gly 
35 180 185 190 

ATA GCC CTT GCG GTG AGC OCT CTC TCA AAG AAA GCA AAG GCT CTT CTG 624 
He Ala Leu Ala Val Ser Ala Leu Ser Lya Lys Ala Lya Ala Leu Leu 
\ 195 200 205 

TGC GAT GTG CCG TTT CTG TGT CAC TTC AGA AGA GCA CTA CAG CTT GTG 672 
40 Cys Asp Val Pxo Phe Leu Cys His Phe Arg Arg Ala Val Gin Leu Val 

210 215 220 


45 


50 


25 


30 


55 
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G&T AOS CAT CCA TAC GCG GAG ATC ACS AAC TTT CIA AAO ACC CAC ASA 720 
Asp Tbr His Pro Tyr Ala Olu He Thr Asn Phe Leu Lys Thr Bis Arg 
225 230 235 * 340 

QAC AAO OAA GAA ATC OTO TTC AGG ACT CTT TCC TAT TTC GAT GGA GTO 768 

Asp Lys Olu Glu He Val Phe Arg Thr Leu Ser Tyr Che Asp Gly Val 

245 250 355 

AAC TTC GCA OCC AGA OCG AAO ATC OCT CCG CPS TTT TCT GTG OCT CTC 016 
Asn Phe Ala Ala Arg Ala Lys He Pro Ala Leu Phe Ser Val Gly Leu 
260 365 370 


10 ATS GAC AAC ATT TGT CCT CCT TCA ACS CTT TTC GCT GCC TAC AAT TAC 

Met Asp Asn He Cys Pro Pro Ser Thr Val Phe Ala Ala Tyr Asn Tyr 
275 280 285 


864 


15 


20 


25 


30 


35 


40 


45 


50 


55 


TAC OCT GGA COS AAG GAA ATC AGA ATC TAT CCG TAC AAC AAC CAC GAG 864 
Tyr Ala Oly Pro Lys Glu He Arg He Tyr Pro Tyr Asn Asn Bis Glu 
290 395 300 

GGA GGA OGC TCT TTC CAA OCG GTT GAA CAG GTS AAA TTC TTG AAA AAA 913 
Gly Gly Gly Ser Phe Gin Ala Val Glu Gin val Lye Phe Leu Lys Lys 
305 310 315 330 

CTA TTT GAG AAA GGC TAA 930 
- Leu Phe Glu Lys Gly 
325 

(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 660 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
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TTG AAG TAC TTC AAA GCC CGO CTT CCC GCC ATC ACC TTG CTC GGC CTO 
Leu Lya Tyr Pbe Lys Ala Arg Lev Ala Oly lie Thr Leu Leu Gly Leu 
1 5 io 15 

CTO OCC TGC ACC TOO GCC TOO GOG CAG ACC GAG CCC ATC CTC TTC CTG 
Leu Ala Cys Thr Ser Ala Ser Ala Gin Thr Glu Pro lie Val Pbe Val 
20 25 30 


48 


9* 


CAC GGC TAT TCC GGC AGO GCA TCC AAC TGG GAC ACC ATG CTG GGC CGC 
Hie Oly Tyr Ser Gly Ser Ala Ser Aan Trp Asp Thr Met Leu Gly Arg 
35 40 45 


144 


TTC CGG TOG AAC GGT TAT GCG TCC GGC TCG CTC TAC ACC TTC AAC TAC 
Phe Arg Ser Asn Gly Tyr Ala Ser Gly Ser Leu Tyr Thr Pbe Aan Tyr 
50 55 60 


192 


AAC TOG TTG GTC ACC AGC AAC CGC ACC AGC GCC AGC GAG CTG CGC AOC 
Asn Sex Leu Val Ser Ser Asn Arg Thr Ser Ala Ser Glu Leu Arg Ser 
65 10 75 eo 


240 


TTC GTC AAC ACC GTG CGT TCG CGC CAC CCC AAC GCC CGC ATC GCG CTG 

Phe val Asn Thr Val Arg Ser Arg His Gly Asn Ala Arg lie Ala Leu 

65 90 9S 


208 


GTC GCC CAC TCC AAC GGC GGG CTG CTG TCG CGC TGG TAT CGC GCG GAG 
Val Ale His Ser Asn Gly Gly Leu Val Ser Arg Trp Tyr Arg Ala Glu 
100 105 HO 


336 


CTO GGC GGC OAA ACQ GCC ACC CGC CGC TTC GTG ACG CTG GGC ACG COG 
Leu Gly Gly Glu Thr Ala Thr Arg Arg Phe Val Thr Leu Gly Thr Pro 
115 120 125 


384 


CAC CGG GGC ACC ACC TGG GCC TAT* GCG TGC TAC AGC CCC GCA TGT TTC 
His Arg Oly Thr Thr Trp Ala Tyr Ala cys Tyr Ser Pro Ala Cys Phe 
130 135 140 


432 


GAG ATG CGC CCC GGC TCC AGC TTG CTG ACC ACQ CTG GGC TCG CGT GCC 480 
Glu Wet Arg Pro Oly Ser Ser Leu Leu Thr Thr Leu Gly Ser Arg Ala 
145 150 155 160 

TGC GAC CGC TCG CTG TGG TCG AAC ACC GAC GGC ATC ATC CTG COG GCG 528 
Cys Asp Arg Ser Leu Trp Ser Asn Thr Asp Oly lie lie Leu Pro Ala 
165 170 175 

TCC AGC GCC CAG TOT GGT GTC AGC ACG CGC ACT GCC GAC GTC AGC CAT 576 

Ser Ser Ala Gin Cys Gly Val ser Thr Arg Thr Ala Asp val ser Bis 

180 185 190 

CTC GAC CTG CTG ACC GAC TCT CGC GTG TAC ACQ CAG TTG CGC ACQ CAG 624 
Leu Asp Leu Leu Thr Asp Ser Arg Val Tyr Thr Gin Leu Arg Thr Gin 

195 200 205 

TTG GAA TGA GGG TQA OGG TGC ACC GAA CGT GCA OCT G 661 
Leu Gin Bnd Gly Bnd Arg Cys Thr Glu Arg Ala Pro 
210 215 220 


(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 201 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
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Met Lys Val Lys Hia Val He Val Leu Bis Gly Leu Tyr Met Ser Gly 
15 10 15 

Leu Val Met Arg Pro Leu Cys Ser Arg Leu Glu Glu Ser Gly Val Lys 
20 25 30 

5 

Val Leu Aan Leu Thr Tyr Asn Thr Arg Asp pro Asn Arg Asp Ala Zle 
35 40 4$ 

Phe Thr Gin lie Asp Qlu Pbe He Ser Asn Glu Pro Ser Ala Leu Val 
50 5S GO 

70 Cys His Ser Mat Gly Gly Leu Val Ala Arg Ala Tyr Leu Glu Ala Asn 

65 70 75 60 

Ser Ala Pro Ser Hia Hia val Glu Lya Val He Thr Leu Oly Thr pro 
05 90 95 


15 


20 


25 


30 


His Thr Gly Ser Kis lie Ala Glu Lys Met Gin Gin Lys Oly Pbe Glu 
100 105 110 

Leu Leu Leu Lys Asn Ser Val Glu Phe Leu Leu Ser Lys Asn Gly Asp 
115 120 125 

Trp Pro Pbe Lys Ala Lys Leu Tyr Ser He Ala Gly Asp Leu Pro He 
130 135 140 

Gly Leu Met Pro Leu He Val Lys Gly Ser Arg Ser Asp Gly Thr Val 
145 150 155 ICO 

Leu Leu Asp Glu Thr Lys Leu Lys Gly Met Ala Glu His Lys Val Phe 

165 170 175 

His Leu Ser Bis Thr Ser Met He Tyr Ser Arg Gin Val Val Asn Tyr 
180 IBS 190 

He Leu Glu Arg Leu Asn Glu Asp He 
195 200 


(2) INFORMATION FOR SEQ ID NO:54 : 
(i) SEQUENCE CHARACTERISTICS 


(A) LENGTH: 259 AMINO ACIDS 

(B) TYPE: AMINO ACID 
35 (D) TOPOLOGY: LINEAR 


(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 


40 


45 


50 


55 
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Mat He Lys Asn Phe Asp Arg Glu Asn Ser Ser Leu Val Leu Ser Oly 
1 5 io XS 

Oly Oly Ala Leu Gly lie Ala His Leu oly val Leu His Asp Leu olu 

20 .35 .30- 

Lya Ola Aon lie val Pro Asn Olu He Val Gly Thr Ser Met Gly Gly 
35 «0 45 

He He Oly Ala ser Met Ala He Oly Met Lys Glu Lys Olu He Leu 
50 5S CO 

Glu Glu He Lys Asn Phe Ser Asn Val Phe Asn Trp He Lys Phe Ser 
«5 70 75 eo 

Phe Ser Gly Asn Ser Val val Asp Asn Glu Lys He Ala Lys lie Phe 
M SO 95 

Asp Thr Leu Phe Lys Asp Arg Lys Met Thr Asp Thr val He Pro Leu 
ioo ios no 

Lys Leu He Ala Thr Asn Leu His Asn Oly His Lys Lys Val Phe Thr 
US 120 135 

Ala Ser Asp Asp Val Leu He Lys Asp Ala He Leu Ser Thr Net Ala 
130 135 140 

He Pro Oly Val Phe Olu Glu Hla He He Asp Oly Olu Thr Tvr Olv 
US 150 1S5 l«o 

Asp Gly Phe Leu Cys Glu Asn Leu oly Val Asn Olu Ala Thr phe Asn 

1«S 170 175 

Asp Val Leu Ala Val Aap Val Met oly Olu Asn Ser Phe Olu Lys Ala 
lap ios 190 

Met Pro Adp Asn Phe Phe Lye Thr Ser Asn Val Leu Glu Mat Phe Glu 
19S 200 205 

Lys Ser Met Arg Leu Phe He Tyr Asn Oln Thr Gin Thr His He Lys 
210 215 220 

Asn Ala Aim Lys Asn He Tyr Leu Ho Glu Pro Val Thr Lys Glu Tyr 
235 230 235 240 

Lys Thr Phe Gin Phe His Lys His Lys Glu He Arg Ala Leu Gly Leu 
24S 250 255 

Gly Leu Leu 


(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 301 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 


Mec Pro Leu His pro Lye Val Lys Lys Leu Leu Ser Gin Leu Pro Pro 
i 5 10 is 

Gin Asp Phe Ser Arg Asn Val Gin Asp Leu Arg Lys Ala Trp Asp Leu 
20 2S 30 

Pro Phe Ser Gly Arg Arg Glu Thr Leu Lys Arg Val Glu Asp Leu Olu 
35 40 45 

He Pro Thr Arg Asp Ala Arg He Arg Ala Arg Val Tyr Thr Pro Ser 
SO 55 GO 
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10 


Ser Lys Glu Asn Lea Pro Val Leu Val Tyr Tyr His Cly Oly Gly Phe 
« 70 75 ao 

val Phe Oly Ser Vsl Asp Ser Tyr Asp Oly Leu Als Ser Leu He Als 
85 90 95 

Lys Olu Ser Gly He Ala Val He Ser Val Glu Tyr at? Leu Ala Pro 
100 105 HO 

Olu His Lys Phe Pro Thr Ala Val Asn Asp Ser Trp Asp Ala Leu Leu 

115 130 125 

Trp He Ala Olu Asn Oly Gly Lys Leu Oly Leu Asp Thr Ser Ara Leu 
X30 135 140 

Ala val Ala Oly Asp ser AXa Gly Oly Asn Leu Ser Ala val Val ser 
1« ISO 155 150 

Leu Leu Asp Arg Asp Gin Gly Lyo oly Leu Val Ser Tyr Oln Val Leu 
1 $ 1«5 170 175 

He Tyr Pro Ala VaX Asn Met Val Asp Asn Ser Pro ser Val Ara Glu 
100 185 1*0 

Tyr Oly Olu Oly Tyr Phe Leu Thr Arg Ser Met Net Asn Trp Phe Oly 
195 200 205 

Thr Met Tyr Phe Ser ser Oly Arg Glu Ala Val Ser Pro Tyr Ala Ser 
310 315 230 

Pro Ala Leu Ala Asp Leu Bis Asn Leu Pro Pro Ser Leu val lis Thr 
235 230 235 240 

Ala olu Tyr Asp Pro Leu Arg Asp Oln Gly Glu Tnr Tyr Ser Bis Ser 
250 255 360 

Leu Asn Glu Ala Gly Asn Val Ser Thr Leu Val Arg Tyr Oln oly Met 
2<5 370 275 

He His Oly Phe Leu ser Phe Tyr Glu Trp He Thr Ala Oly Lys Leu 
280 2B5 290 

Ala He His Mis He Ala oly Val Leu Arg Ser Val Leu 
295 300 305 


20 


25 


30 


35 


(2) INFORMATION FOR SEQ ID NO:56: 
(i) SEQUENCE CHARACTERISTICS 


(A) LENGTH: 326 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

45 


50 


55 
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w 


15 


Val Ala Phe Phe Asp Mat Pro Leu Olu Glu Leu Lys Lys Tyr Arg Pro 
13 10 15 

Olu Arg Tyr Clu Olu Lys Asp Phe Asp Glu Phe Trp Arg Olu Thr Leu 
ao is 30 

Lys Glu Ser Glu Gly Phe Pro Leu Asp Pro Val Phe Glu Lys Val Asp 
35 40 45 

Phe Bis Leu Lys Thr Val Glu Thr Tyr Asp Val Thr Phe Ser Gly Tyr 
SO 55 €0 

Arg Gly Gin Arg lie Lys Gly Trp Leu Leu Val Pro Lys Leu Ala Glu 
«5 70 75 80 

Glu Lys Leu Pro Cys Val Val Gin Tyr He Gly Tyr Asn Gly Gly Arg 
05 90 95 

Gly Phe Pro His Asp Trp Leu Phe Trp Pro Ser Met Gly Tyr lie Cys 
100 105 no 


Phe" Val Met Asp Thr Arg Gly Gin Gly Ser Gly Trp Met Lys Gly Asp 
20 115 * 120 125 

Thr Pro Asp Tyr Pro Glu Gly Pro Val Asp Pro Gin Tyr Pro Oly Phe 
130 135 140 

Met Thr Arg Gly He Leu Asp Pro Gly Thr Tyr Tyr Tyr Arg Arg val 
145 150 155 160 

Phe Val Asp Ala Val Arg Ala Val Glu Ala Ala lie Ser Phe Pro Arg 
16S 170 175 

val Asp Ser Arg Lys Val Val val Ala Gly Gly Ser Gin Gly Gly Gly 
180 105 190 

lie Pro Leu Ala val ser Ala Leu Ser Asn Arg Val Lys Ala Leu Leu 
195 200 205 

Cys Asp Val Pro Phe Leu Cys His Phe Arg Arg Ala Val Gin Leu Val 
210 215 220 

Asp Thr His Pro Tyr Val Glu He Thr Asn Phe Leu Lys Thr His Arg 
MS 230 235 240 

Asp Lys Glu Glu He Val Phe Arg Thr Leu Ser Tyr Phe Asp Oly Val 
245 250 255 

Asn Phe Ala Ala Arg Ala Lys Val Pro Ala Leu Phe Ser Val Gly Leu 
260 265 270 

Met Asp Thr He Cys Pro pro Ser Thr Val Phe Ala Ala Tyr Asn His 
275 2B0 285 

Tyr Ala Gly Pro Lys Glu He Arg He Tyr Pro Tyr Asn Asn His Glu 
290 295 300 

Oly Gly Gly Ser Phe Gin Ala He Glu Gin Val Lys Phe Leu Lys Arg 
305 310 3is 320 

Leu Phe Glu Glu Gly 
325 


25 


30 


35 


40 


45 


50 


{2} INFORMATION FOR SEQ ID NO:57: 
(i) SEQUENCE CHARACTERISTICS 


(A) LENGTH: 298 AMINO ACIDS 

(B) TYPE: AMINO ACID 
55 (D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 


Wet Arg Thr Leu Ser Phe Gly Pro Met Thr Thr Gly Gly 8er Xle Hie 
15 10 15 

Met Ala Thr Met Asp VaX Hot Arg Gly Pro Gly Met Gin Arg Leu Ser 
20 25 30 

Gin Gly Ala Arg Glu Ala Ala Asn His Pro Trp Ala Lys Arg Leu Gly 
35 40 45 

Arg Het Gly Tyr Ala Ala Lys Gly Ala Val Tyr Ala lie lie Gly Val 
50 55 60 

Leu Ala Leu Lys Leu Ala Ala Gly Glu Gly Gly Arg Thr Thr Asp Ser 
65 70 75 80 

Hxs Gly Ala Val Asn Thr Val Ala Mia Gly Pro Phe Gly Val Ala Leu 
85 90 95 

Leu Ala Val Leu Val Val Gly Leu Leu Gly Tyr Val Val Trp Arg Phe 
100 105 110 

Ala Gin Ala Phe Val Asp Thr Glu Asp Lys Gly Ser Asp Ala Lys Gly 
1X5 X20 X25 


lie Ala Thr Arg Ala net Tyr Phe l*u Ser Gly cys Xle Tyr Ala Ser 
130 135 140 

Leu Ala Phe Phe Ala Ala Gin ser Leu Val Gly Ala Ala Hie Gly Arg 
145 150 155 160 

Ser Lys Gly Thr Gin Gly Trp Thr Ala Thr Leu Met Glu Gin Pro Phe 
165 170 175 

Oly Arg Val Leu Val Ala Leu Val Gly Leu Gly lie Val Gly Phe Ala 
180 185 190 

Lfcu Lys Gin Phe Bis Thr Ala Trp Lys Ala Lys Phe Arg Glu Lys Leu 
195 200 205 

Thr Leu Thr Gly Leu Ala Ala Arg Lys Gin His Bis Zle Glu Arg Het 
aiO 215 220' 

Cys Gin Phe Gly Xle Ala Ala Arg Gly Val Val Phe Ala Val lie Gly 

225 230 23S 240 

Gly Phe Leu Val Arg Ser Ala Val Asp Ala Asn Pro Gly Glu Ala Lys 
245 250 255 

Gly Leu Gly Glu Ala Leu Ala Val VaX Ala Arg Gin Pro Ser Gly Asp 
260 26S 270 

Val Leu Leu Gly Val Val Ala Ala Gly Leu Val Ala Tyr Ala Ala Tyr 
275 280 205 

Leu Phe Leu Gin Ala Arg Tyr Arg Glu Leu 
290 295 


(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 304 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
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Met Ser Lys Phe Ala He Leu Trp Ala Leu Zle Thr Ala Tyr Leu pro 
15 10 is 

Olu Pro Val Met Lys Leu Val Tyr Leu Gly Arg Arg Glu Thr Leu Oly 
20 25 30 

Ala Arg Thr Leu Asp Val Lys Ala Cln Ala Val Oly Arg Leu Ala Asa 
35 40 45 

Ala Thr Arg Pro Val Qiy val zle Pro Thr Val Olu Glu Ser Arg bya 
50 55 €0 

Met Thr Asp Lye Ala Val Ser Leu Phe Asp cln Pro Ala Pro Olu Leu 
65 70 75 00 

Die Arg Lys Lys Asp He Gin lie Asp Gly Ala Glu Gly Pro Zle Asp 
85 90 95 

Ala Arg Zle Tyr Ser Gly Pro Ala Lys His Arg Pro Arg Pro zle Leu 
100 105 110 

Val Tyr Phe His Gly Oly Gly Trp Val Gin Gly Asn Leu Asp Ser His 
115 120 125 

Asp Oly Val Cys Gly Lys Leu Ala Lys Trp Ala Asa Cys Zle val Xle 
130 135. 140 

Ser Val Asp Tyr Arg Leu Ala Pro Glu Bis Lys Phe Pro Cys Ala Pro 
145 150 * 155 150 

Leu Asp Ala Xle Ala Ala Tyr Lys Trp Val Arg Ala Aen Ala Thr Asn 

155 170 175 


Leu "Gly Gly Asp Pro Glu Arg He Gly Val Gly Gly Asp Ser Ala Gly 
180 185 190 

Gly Asa Leu Ala Ala Val Val Cys Gin Gin Thr Ala Met Asn Oly Glu 
195 200 205 

Arg Thr Pro Asp Leu Gin Val Leu He Tyr Pro Ala Leu Asp Ala Arg 
210 215 220 

Mat Zle Ser Thr Ser Met Glu Glu Leu Arg Asp Ala Tyr Xle Leu Pro 
225 230 235 240 

Lys Ser Arg Met Glu Tyr Phe Leu Gly Leu Tyr Thr Arg Oly Pro Asp 
245 250 255 

Asp lie Glu Asp Leu Arg Met Ser Pro Zle Leu Arg Asp Thr Val Ala 
260 265 270 

Asp Gin Pro Gin Ala Cys Zle Val Thr Cys Gly Phe Asp Pro Ala Arg 
275 280 285 

Arg Arg Glu Bis Leu Arg Arg Thr Leu Asn Cys Arg Gly Asp Arg Arg 
290 295 " 300 


(2) INFORMATION FOR SEQ ID NO:59: 


(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 308 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
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Val Ser He Arg Lou Arg Leu Leu Asn Trp Phe Leu Aan ttir Phe Glu 
1 5 10 IS 

Ly0 Pro Lys Leu Ala Ala Ala Lye Thr pro Asp Asp Leu Arg Lye Ser 
20 25 30 

Phe Glu Leu Lya Ala Arg Phe Leu Phe Pro Ala Pro Arg Lye Thr Arg 
35 40 45 

Phe Ser Hie Asp Val Leu Gin Ser Gly lie Gly ser Val Abo Ala Gin 
50 55 60 

Trp Ala Lys Ser Lys Ser Ala Ser Asp Asp Arg Val Xle Leu Tyr Phe 
€5 70 75 00 

His Gly Gly Gly Tyr Val Phe Gly Ser Pro Lys Thr His Arg Ala Mec 
85 90 95 

Leu Ala Arg Leu Ser Ala Mec Thr Gly Leu Ser Ala Cys Leu Pro Asp 
100 105 1X0 

Tyr Arg Leu Ala Pro Glu His Pro Phe Pro Ala Ala He Glu Asp Ala 
115 120 125 

Val Leu Ser Tyr Lys Cys Leu Leu Glu Arg Ala He Glu Pro Gin Asn 
130 135 140 

20 lie He Leu Gly Gly Asp Ser Ala Gly Gly Gly Leu Val Leu Ala Leu 

145 ISO 155 160 

Leu Ala Glu He Lys Ala Gin Ser Leu Pro Lya Pro Ala Gly Val Phe 
165 170 175 


10 


15 


25 


30 


50 


Ala Leu Ser Pro Leu Val Asp Leu Ser Phe Ser Gly Leu Ser Phe Ser 
180 165 190 

Lys Asn Ala Gin Thr Asp Val Mec Leu Pro Ala Ser Arg Ala Ala Asp 
195 200 205 

Met Ala Thr Leu Tyr Leu Asp Gly Ala Asp Ala Asp Asp Pro Arg Ala 
210 215 220 


Ser Pro Leu Gin Ala Asp Phe Ser Gly Mec Pro Pro Val Phe Leu Thr 
225 230 235 240 

35 Ala Ser Asp Ser Glu He Leu Leu Aep Asp Cys Leu Arg Met Ala Asp 

245 250 255 

His Leu Arg Ala Gin Oly Val Val Val Thr Asp Arg He Val Glu Aan 
260 265 270 

Bis Pro Hie Val Trp His He Phe Gin Arg Leu Leu Pro Glu Ala Asp 
40 275 280 285 

Gin Gly Leu Arg Ala He Ala Ala Trp He Lys Pro Leu Leu Ser Gly 
290 295 300 

Ser Asn Glu Ser 
305 

45 


(2) INFORMATION FOR SEQ ID NO:60: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 237 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

55 (ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 
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10 


15 


20 


25 


30 


35 


45 


50 
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Met Leu Thr Phe Asn Val Leu Tyr Gly Met Met Lys Gin Lye Leu Ala 
1 S 10 'is 

Ala Xle Leu Met Phe Leu Oly Leu Ser Ala Ala Glu Ala Ola Asp Trp 
30 25 30 

Pro Asp Leu Gin Lys Tyr Arg Sex Ala Asn Lys Olu Ala Lys Leu Leu 
35 40 45 

Pro Lys Glu Asn Arg Lys val Val Phe Net Gly Ash Ser lie Thr Glu 
SO 55 60 

Ala Trp Xle Ser Gin Arg Pro Glu Phe Phe ser Glu Asn Gly Phe lie 
65 70 75 60 

Oly Arg Gly lie Ser Gly Gin Thr Thr Pro Gin Met Leu Leu Arg Phe 
05 90 95 

Arg Gin Asp Val lie Asp Leu Gin Pro Lys Ala Val Val Ila Leu Ala 
100 105 110 

Gly Thr Asn Asp val Ala Gin Asn Thr Oly Pro Met Thr lie Glu Olu 
115 120 125 

Ser Leu Ala Asn He Lys Ser Ket Val Olu Leu Ala Gin Ala Asn Gly 
130 135 140 

He Thr Pro Val Leu Cys Thr Val Leu Pro Ala Asp Arg Phe Ser Trp 

145 150 155 1(0 

Arg Pro Glu Leu Thr Pro Ala Glu Thr He He Ala Leu Asn Gin Leu 
165 170 175 

He Lys Gin Tyr Ala Olu Ala Gin Gly Leu Ala Leu Val Asp Tyr His 
180 185 190 

Ala Ala Leu Thr Asn Lys Gly Gly Gly Leu Pro Val Lys Tyr Gly Glu 
195 200 205 

Asp Gly Val His Pro Asn Val Ala Gly Tyr Gin val Met Glu Asn He 
210 215 220 

val Leu Pro Val He Ser Ser Glu Leu Ala Lye Leu Lys 
225 230 335 


(2) INFORMATION FOR SEQ ID HO: 61: 


(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS 

40 (A) LENGTH: 326 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 


(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 


55 
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10 


Met Ala Pbe Phe Asp Leu Pro Leu OIu Glu Leu Lye Lye Tyr Arg Pro 
IS 10 15 

Glu Arg Tyr Olu Olu Lys Asp Pbe Asp Glu Phe Trp Olu Glu Thr Leu 
20 25 30 

AXa Olu Ser Olu Lys Phe Pro Leu Asp Pro Val Phe Olu Arg Met Glu 
35 40 45 

Ser Bis Leu Lya Thr Val Glu Ala Tyr Asp Val Thr Pbe Ser Cly Tyr 
SO 55 60 

Arg Oly Gin Arg He Lys Gly Trp Leu Leu Val Pro Lys Leu Glu Olu 
65 70 75 60 

Olu Lys Leu Pro Cys Val Val Gin Tyr lie Gly Tyr Asa Gly Gly Arg 
85 90 95 

Oly Phe Pro His Asp Trp Leu Phe Trp pro Ser Met Oly Tyr He Cys 
15 200 105 110 

Phe Val Met Asp Thr Arg Oly Olu Gly Ser Gly Trp Leu Lys Oly Asp 
115 120 125 

Thr Pro Asp Tyr Pro Glu Gly Pro Val Asp Pro Oln Tyr Pro Oly Phe 
130 135 140 

Met Thr Arg Oly He Leu Asp Pro Arg Thr Tyr Tyr Tyr Arg Arg Val 
145 150 155 ISO 

Phe Thr Asp Ala Val Arg Ala Val Glu Ala Ala Ala Ser Phe Pro Gin 
165 170 175 

Val Asp Oln Glu Arg He Val He Ala Gly Oly Ser Oln Oly Oly Gly 

25 180 185 190 

He Ala Leu Ala Val Ser Ala Leu Ser Lys Lys Ala Lys Ala Leu Leu 
195 200 205 


20 


30 


Cys Asp Val Pro Phe Leu cys His Phe Arg Arg Ala Val Oln Leu Val 
210 215 220 

Asp Thr His Pro Tyr Ala Glu He Thr Aso Phe Leu Lys Thr His Arg 
225 230 235 240 

Asp Lys Glu Olu He Val Phe Arg Thr Leu Ser Tyr Phe Asp Oly Val 
245 250 255 

Asn Phe Ala Ala Arg Ala Lys He Pro Ala Leu Phe Ser Val Gly Leu 
35 260 265 270 

Met Asp Aan He Cys Pro Pro Ser Thr Val Phe Ala Ala Tyr Asn Tyr 
275 280 285 

Tyr Ala Oly Pro Lys Glu He Arg He Tyr Pro Tyr Asn Asn His Olu 
290 295 300 


40 


45 


Gly Gly Oly Ser Phe Gin Ala Val Glu Oln Val Lya Phe Leu Lys Lys 
305 310 315 320 

Leu Phe Glu Lys Gly 
325 


(2) INFORMATION FOR SEQ ID NO:62: 
(i) SEQUENCE CHARACTERISTICS 
50 (A) LENGTH: 220 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

55 
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(ii) mxsaas tyrk: protein 

(xl) SBOUBSCE DESCRIPTION: SEO ID BO:€2: 

Leu Lys Tyr Phe Lys Ala Arg Leu Ala Oly lie Thr Leu Leu 01 y Leu 
5 1 5 10 15 

Leu Ala Cys Thr Ser Ala Ser Ala oin Thr Glu pro ZXe Val phe Val 
20 25 30 

His Oly Tyr Ser Oly Ser Ala Ser Asn Trp Asp Thr Met Leu Oly Arg 
35 40 * 45 

Phe Arg Ser Asn Cly Tyr Ala Ser Gly Ser Leu Tyr Thr Phe Asa Tyr 
50 55 60 

Aan Ser Leu Val Ser Ser Asa Arg Thr Ser Ala ser Glu Leu Arg Ser 

65 70 75 80 

15 Phe Val Asn Thr Val Arg Ser Arg Hie Oly Asn Ala Arg Xle Ala Leu 

B5 90 95 

Val Ala Mia Ser Aon Oly Cly Leu Val Ser Arg Trp Tyr Arg Ala Olu 
100 105 110 

Leu Cly Gly Glu Thr Ala Thr Arg Arg Phe Val Thr Leu Gly Thr Pro 
20 115 120 125 

His Arg Gly Thr Thr Trp Ala Tyr Ala Cys Tyr Ser Pro Ala cya Phe 
130 135 140 

Glu Met Arg Pro Gly Ser Ser Leu Leu Thr Thr Leu Gly Ser Arg Ala 
145 150 155 160 

25 Cys Asp Arg Ser Leu Trp Ser Aan Thr Asp Oly He lie Leu Pro Ala 

165 170 175 

Ser Ser Ala Gin Cys Gly Val Ser Thr Arg Thr Ala Asp Val Ser His 
180 185 ISO 

Leu Asp Leu Leu Thr Asp Ser Arg Val Tyr Thr Gin Leu Arg Thr Gin 
195 200 205 

Leu Gin find Gly End Arg Cys Thr Olu Arg Ala Pro G 
210 215 220 


30 


35 


SEQUENCE LISTING 
[0133] 

40 (1) GENERAL INFORMATION: 

(i) APPLICANT: Diversa Corporation 

(ii) TITLE OF INVENTION: ESTERASES 

45 

(iii) NUMBER OF SEQUENCES: 62 

(iv) CORRESPONDENCE ADDRESS: 

50 (A) ADDRESSEE: Morrison & Foerster LLP 

(B) STREET: 3811 Valley Centre Drive, Suite 500 

(C) CITY: San Diego 

(D) STATE: CA 

(E) COUNTRY: USA 
55 (F) ZIP: 92130 

(v) COMPUTER READABLE FORM: 
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(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US97/02039 

(B) FILING DATE: 11-FEB-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/602,359 

(B) FILING DATE: 16-FEB-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Einhorn, Ph.D., D.D.S., Gregory 

(B) REGISTRATION NUMBER: 38,440 

(C) REFERENCE/DOCKET NUMBER: 09010/010WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 858 720 5133 

(B) TELEFAX: 858 720 5125 

(2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 


CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGTCTTTA AACAAGCACT CT 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 


CGGAAGATCT CTATCGTTTA GTGTATGATT T 
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(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 


15 CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAAACTC CTTGAGCCCA CA 52 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

20 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

30 

CGGAAGATCT CGCCGGTACA CCATCAGCCA C 31 

(2) INFORMATION FOR SEQ ID NO:5: 
35 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
*o (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

45 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCATAT GTTAGGAATG GT 52 


50 (2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

CGGAGGTACC TTAGAACTGT GCTGAAGAAA TAAATTCGTC CAT TGC TCTA TTA 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CGGAGGTACC TTAGAACTGT GCTGAAGAAA TAAATTCGTC CATTGCTCT 
(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGATTG AGGAAATTTG AAG 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDN A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CGGAGGTACC CTATTCAGAA AGTACCTCTA A 

(2) INFORMATION FOR SEQ ID NO:10: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 


CCGAGAATTC A TT AAA GAG G AGAAATTAAC TATGTTTAAT ATCAATGTCT TT 

(2) INFORMATION FOR SEQ ID NO: 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 


CGGAAGATCT TTAAGGATTT TCCCTGGGTA G 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 


CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGAGGTT TACAAGGCCA AA 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
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CGGAGGTACC TTATTGAGCC GAAGAGTACG A 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGATTGGC AATTTGAAAT TGA 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

CGGAGGTACC TTAAAGTGCT CTCATATCCC C 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC T A TGCCAGC T AATGACTCAC CC 
(2) INFORMATION FOR SEQ ID NO:17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

CGGAAGATCT TCAACAGGCT CCAAATAATT TC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

CGGAAGATCT ACAGGCTCCA AATAATTTC 

(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCTTGAT ATGCCAATCG AC 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CGGAGGTACC CTAGTCGAAC AGAAGAAGAG C 
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(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STR AN D E DN ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 


CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCCCTA GATCCTAGAA TT 


(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 


CGGAGGTACC TTAAATTTTA TCATAAAATA C 


(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...552 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
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ATG TCT TTA AAC AAG CAC TCT. TGG ATG GAT ATG ATA ATA TTT ATT CTC 48 
Met Ser Leu Asn Lys His Ser Trp Met Asp Met He He Phe lie Leu 
15 10 15 

5 

AGC TTT TCT TTC CCA TTA ACA ATG ATC GCA TTA GCT ATC TCT ATG TCG 96 
Ser Phe Ser Phe Pro Leu Thr Met He Ala Leu Ala He Ser Met Ser 
20 25 30 

TCA TGG TTT AAT ATA TGG AAT AAT GCA TTA AGC GAT CTA GGA CAT GCT 144 
10 Ser Trp Phe Asn He Trp Asn Asn Ala Leu Ser Asp Leu Gly His Ala 

35 40 45 

GTT AAA AGC AGT GTT GCT CCA ATA TTC AAT CTA GGT CTT GCA ATT GGT 192 
Val Lys Ser Ser Val Ala Pro He Phe Asn Leu Gly Leu Ala He Gly 
15 50 55 60 

GGG ATA CTA ATT GTT ATA GTT GGT TTA AGA AAT CTT TAT TCG TGG AGT 240 
Gly He Leu He Val He Val Gly Leu Arg Asn Leu Tyr Ser Trp Ser 
65 70 75 80 

20 


25 


30 


50 


AGA GTT AAA GGA TCT TTA ATC ATA TCC ATG GGT GTA TTT CTT AAC TTA 288 

Arg Val Lys Gly Ser Leu He lie Ser Met Gly Val Phe Leu Asn Leu 
85 90 95 

ATA GGG GTT TTC GAC GAA GTA TAT GGT TGG ATA CAT TTC CTA GTC TCA 336 

He Gly Val Phe Asp Glu Val Tyr Gly Trp lie His Phe Leu Val Ser 
100 105 110 

GTA TTG TTT TTC TTA TCA ATA ATA GCA TAT TTC ATA GCT ATA TCA ATA 384 

Val Leu Phe Phe Leu Ser He lie Ala Tyr Phe lie Ala lie Ser He 
115 120 125 


CTT GAC AAA TCA TGG ATA GCT GTT CTA CTA ATA ATA GGT CAT ATT GCA 432 
Leu Asp Lys Ser Trp lie Ala Val Leu Leu lie lie Gly His lie Ala 
35 130 135 140 

ATG TGG TAT CTA CAC TTT GCT TCA GAG ATT CCG AGA GGT GCG GCT ATT 480 

Met Trp Tyr Leu His Phe Ala Ser Glu lie Pro Arg Gly Ala Ala He 
145 150 155 160 

40 

CCC GAG TTA TTA GCG GTA TTC TCG TTT TTA CCA TTC TAT ATA AGA GAC 528 

Pro Glu Leu Leu Ala Val Phe Ser Phe Leu Pro Phe Tyr lie Arg Asp 
165 170 175 

TAT TTT AAA TCA TAC ACT AAA CGA TAG 555 
45 Tyr Phe Lys Ser Tyr Thr Lys Arg 
180 


(2) INFORMATION FOR SEQ ID NO:24: 
(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 1041 base pairs 

(B) TYPE: nucleic acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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10 


15 


20 


25 


35 


40 


45 


50 


55 


(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1... 1038 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 


ATG AAA CTC CTT GAG CCC ACA AAT ACC TCC TAC ACG CTG TTA CAG GAT 48 
Met Lys Leu Leu Glu Pro Thr Asn Thr Ser Tyr Thr Leu Leu Gin Asp 
15 10 15 

TTA GCA TTG CAT TTT GCA TTT TAC TGG TTT CTG GCC GTG TAT ACG TGG 96 
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Leu Ala Leu His Phe Ala Phe Tyr Trp Phe Leu Ala Val Tyr Thr Trp 
20 25 30 


TTA CCC GGT GTC CTA GTC CGG GGC GTA GCT GTG GAC ACA GGG GTG GCT 
Leu Pro Gly Val Leu Val Arg Gly Val Ala Val Asp Thr Gly Val Ala 
35 40 45 


144 


10 


CGG GTG CCT GGG CTC GGC CGG CGC GGT AAG AGG CTG CTC CTG GCC GCT 
Arg Val Pro Gly Leu Gly Arg Arg Gly Lys Arg Leu Leu Leu Ala Ala 
50 55 60 


192 


15 


GTG GCT GTC TTG GCG CTT GTT GTG TCC GTT GTT GTC CCG GCT TAT GTG 
Val Ala Val Leu Ala Leu Val Val Ser Val Val Val Pro Ala Tyr Val 
65 70 75 80 


240 


20 


GCG TAT AGT AGT CTG CAC CCG GAG AGC TGT CGG CCC GTT GCG CCG GAG 288 
Ala Tyr Ser Ser Leu His Pro Glu Ser Cys Arg Pro Val Ala Pro Glu 
85 90 95 

GGG CTC ACC TAC AAA GAG TTC AGC GTG ACC GCG GAG GAT GGC TTG GTG 336 
Gly Leu Thr Tyr Lys Glu Phe Ser Val Thr Ala Glu Asp Gly Leu Val 
100 105 110 


25 


GTT CGG GGC TGG GTG CTG GGC CCC GGC GCT GGG GGC AAC CCG GTG TTC 
Val Arg Gly Trp Val Leu Gly Pro Gly Ala Gly Gly Asn Pro Val Phe 
115 120 125 


384 


30 


GTT TTG ATG CAC GGG TAT ACT GGG TGC CGC TCG GCG CCC TAC ATG GCT 
Val Leu Met His Gly Tyr Thr Gly Cys Arg Ser Ala Pro Tyr Met Ala 
130 135 140 


432 


35 


GTG CTG GCC CGG GAG CTC GTG GAG TGG GGG TAC CCG GTG GTT GTG TTC 
Val Leu Ala Arg Glu Leu Val Glu Trp Gly Tyr Pro Val Val Val Phe 
145 150 155 160 

GAC TTC CGG GGC CAC GGG GAG AGC GGG GGC TCG ACG ACG ATT GGG CCC 
Asp Phe Arg Gly His Gly Glu Ser Gly Gly Ser Thr Thr lie Gly Pro 
165 170 175 


480 


528 


CGG GAG GTG CTG GAT GCC CGG GCT GTG GTG GGC TAT GTC TCG GAG CGG 
40 Arg Glu Val Leu Asp Ala Arg Ala Val Val Gly Tyr Val Ser Glu Arg 
180 185 190 


57 6 


45 


TTC CCC GGC CGC CGG ATA ATA TTG GTG GGG TTC AGT ATG GGC GGC GCT 624 
Phe Pro Gly Arg Arg He He Leu Val Gly Phe Ser Met Gly Gly Ala 
195 200 205 

GTA GCG ATC GTG GAG GGT GCT GGG GAC CCG CGG GTC TAC GCG GTG GCT 672 
Val Ala He Val Glu Gly Ala Gly Asp Pro Arg Val Tyr Ala Val Ala 
210 215 220 


50 


GCT GAT AGC CCG TAC TAT AGG CTC CGG GAC GTC ATA CCC CGG TGG CTG 
Ala Asp Ser Pro Tyr Tyr Arg Leu Arg Asp Val He Pro Arg Trp Leu 
225 230 235 240 


720 


55 
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GAG TAC AAG ACG CCG CTG CCG GGC TGG GTG GGT GTG CTG GCC GGG TTC 
Glu Tyr Lys Thr Pro Leu Pro Gly Trp Val Gly Val Leu Ala Gly- Phe 
245 250 255 

TAC GGG AGG CTG ATG GCG GGC GTT GAC CTC GGC TTC GGC CCC GCT GGG 
Tyr Gly Arg Leu Met Ala Gly Val Asp Leu Gly Phe Gly Pro Ala Gly 
260 265 270 

GTG GAG CGC GTG GAT AAG CCG TTG CTG GTG GTG TAT GGG CCC CGG GAC 
Val Glu Arg Val Asp Lys Pro Leu Leu Val Val Tyr Gly Pro Arg Asp 
275 280 285 

CCG CTG GTG ACG CGG GAC GAG GCG AGG AGC CTG GCG TCC CGT AGC CCG 
Pro Leu Val Thr Arg Asp Glu Ala Arg Ser Leu Ala Ser Arg Ser Pro 
290 295 300 

TGT GGC CGT CTC GTC GAG GTT CCT GGG GCT GGC CAC GTG GAG GCC GTG 
Cys Gly Arg Leu Val Glu Val Pro Gly Ala Gly His Val Glu Ala Val 
305 310 315 320 

GAT GTG CTC GGG CCG GGC CGC TAC GCA GAC ATG CTG ATA GAG CTG GCG 
Asp Val Leu Gly Pro Gly Arg Tyr Ala Asp Met Leu lie Glu Leu Ala 
325 330 335 


CAC GAG GAG TGC CCT CCG GGG GCC GGT GGC TGA 
His Glu Glu Cys Pro Pro Gly Ala Gly Gly 
340 345 


(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1... 786 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 


ATG CCA TAT GTT 
Met Pro Tyr Val 
1 

GAT GGA CCT GAG 
Asp Gly Pro Glu 
20 

ATG AAT TTT TGG 
Met Asn Phe Trp 


AGG AAT GGT GGT 
Arg Asn Gly Gly 
5 

CCA CCA ATT GTC 
Pro Pro He Val 


AAA GAG CAA AGA 
Lys Glu Gin Arg 


GTA AAT ATC TAT 
Val Asn He Tyr 
10 

TTT GTT CAC GGA 
Phe Val His Gly 
25 

CGT TAT TTT GCA 
Arg Tyr Phe Ala 


TAT GAA CTG GTG 
Tyr Glu Leu Val 
15 

TGG ACA GCA AAT 
Trp Thr Ala Asn 
30 

GGC AGG AAT ATG 
Gly Arg Asn Met 
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35 40 45 

ATG TTG TTT GTC GAT AAC AGA GGT CAT GGC AGG TCC GAT AAG CCA CTT 192 

Met Leu Phe Val Asp Asn Arg Gly His Gly Arg Ser Asp Lys Pro Leu 

50 55 60 

GGA TAC GAT TTC TAC AGA TTT GAG AAC TTC ATT TCA GAT TTA GAT GCG 240 

Gly Tyr Asp Phe Tyr Arg Phe Glu Asn Phe lie Ser Asp Leu Asp Ala 

65 70 75 80 

GTT GTT AGG GAG ACT GGA GTG GAG AAA TTT GTT CTC GTC GGA CAT TCA 288 

Val Val Arg Glu Thr Gly Val Glu Lys Phe Val Leu Val Gly His Ser 

85 90 95 

15 TTC GGA ACA ATG ATC TCT ATG AAG TAC TGT TCG GAG TAT CGG AAT CGG 336 

Phe Gly Thr Met He Ser Met Lys Tyr Cys Ser Glu Tyr Arg Asn Arg 

100 105 110 


10 


20 


25 


30 


35 


40 


45 


GTT CTT GCT CTA ATC CTC ATA GGT GGT GGG AGC AGA ATA AAG CTT CTA 384 

Val Leu Ala Leu He Leu He Gly Gly Gly Ser Arg He Lys Leu Leu 
115 120 125 

CAC AGA ATT GGA TAT CCT TTA GCA AAG ATT CTT GCA TCC ATT GCA TAC 432 

His Arg He Gly Tyr Pro Leu Ala Lys He Leu Ala Ser He Ala Tyr 
130 135 140 

AAG AAG TCT TCA AGA TTG GTC GCA GAT CTT TCC TTT GGC AAA AAT GCT 480 

Lys Lys Ser Ser Arg Leu Val Ala Asp Leu Ser Phe Gly Lys Asn Ala 

145 150 155 160 

GGT GAA CTT AAA GAG TGG GGA TGG AAA CAG GCA ATG GAT TAT ACA CCC 528 

Gly Glu Leu Lys Glu Trp Gly Trp Lys Gin Ala Met Asp Tyr Thr Pro 
165 170 175 

TCC TAC GTG GCA ATG TAC ACG TAC AGA ACT CTA ACG AAA GTG AAT CTT 576 

Ser Tyr Val Ala Met Tyr Thr Tyr Arg Thr Leu Thr Lys Val Asn Leu 
180 185 190 

GAA AAT ATC TTG GAG AAA ATA GAC TGT CCA ACA CTG ATT ATC GTT GGA 624 

Glu Asn He Leu Glu Lys He Asp Cys Pro Thr Leu He He Val Gly 
195 200 205 

GAA GAG GAT GCA CTA TTG CCC GTT AGC AAA TCA GTT GAG CTG AGC AGG 672 

Glu Glu Asp Ala Leu Leu Pro Val Ser Lys Ser Val Glu Leu Ser Arg 
210 215 220 

AGG ATA GAA AAC TCA AAG CTT GTG ATC ATC CCA AAC TCG GGG CAT TGC 720 

Arg He Glu Asn Ser Lys Leu Val He He Pro Asn Ser Gly His Cys 

225 1 230 235 240 

GTA ATG CTT GAG AGT CCA AGT GAG GTT AAT AGA GCA ATG GAC GAA TTC 768 

Val Met Leu Glu Ser Pro Ser Glu Val Asn Arg Ala Met Asp Glu Phe 
245 250 255 

ATT TCT TCA GCA CAG TTC TAA 789 


55 
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lie Ser Ser Ala Gin Phe 
260 


(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 


20 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...753 


10 


15 


25 


35 


45 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 


TTG AGA TTG AGG AAA TTT GAA GAG ATA AAC CTC GTT CTT TCG GGA GGA 48 

Leu Arg Leu Arg Lys Phe Glu Glu lie Asn Leu Val Leu Ser Gly Gly 
1 5 10 15 

30 GCT GCA AAG GGC ATA GCC CAC ATA GGT GTT TTG AAA GCT ATA AAC GAG 96 

Ala Ala Lys Gly lie Ala His lie Gly Val Leu Lys Ala He Asn Glu 
20 25 30 


CTC GGT ATA AGG GTG AGG GCT TTA AGC GGG GTG AGC GCC GGG GCA ATC 144 
Leu Gly lie Arg Val Arg Ala Leu Ser Gly Val Ser Ala Gly Ala He 
35 40 45 

GTT TCG GTC TTT TAT GCC TCA GGC TAC TCC CCT GAA GGG ATG TTC AGC 192 
Val Ser Val Phe Tyr Ala Ser Gly Tyr Ser Pro Glu Gly Met Phe Ser 
50 -55,- 60 

CTT CTG AAG AGG GTA AAC TGG CTG AAG CTG TTT AAG TTC AAG CCA CCT 240 
Leu Leu Lys Arg Val Asn Trp Leu Lys Leu Phe Lys Phe Lys Pro Pro 
65 70 75 80 

CTG AAG GGA TTG ATA GGG TGG GAG AAG GCT ATA AGA TTC CTT GAG GAA 288 
Leu Lys Gly Leu He Gly Trp Glu Lys Ala lie Arg Phe Leu Glu Glu 
85 90 95 


GTT CTC CCT TAC AGG AGA ATA GAA AAA CTT GAG ATA CCG ACG TAT ATA 336 

Val Leu Pro Tyr Arg Arg He Glu Lys Leu Glu He Pro Thr Tyr He 
50 100 105 110 

TGC GCG ACG GAT TTA TAC TCG GGA AGG GCT CTA TAC CTC TCG GAA GGG 384 

Cys Ala Thr Asp Leu Tyr Ser Gly Arg Ala Leu Tyr Leu Ser Glu Gly 
115 120 125 

55 

AGT TTA ATC CCC GCA CTT CTC GGC AGC TGT GCA ATT CCC GGC ATA TTT 432 

Ser Leu He Pro Ala Leu Leu Gly Ser Cys Ala He Pro Gly He Phe 
130 135 140 
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GAA CCC GTT GAG TAT AAG AAT TAC TTG CTC GTT GAC GGA GGT ATA GTT 
Glu Pro Val Glu Tyr Lys Asn Tyr Leu Leu Val Asp Gly Gly lie Val 
145 150 155 160 

AAC AAC CTT CCC GTT GAG CCC TTT CAG GAA AGC GGT ATT CCC ACC GTT 
Asn Asn Leu Pro Val Glu Pro Phe Gin Glu Ser Gly lie Pro Thr Val 
165 170 175 

TGC GTT GAT GTC CTT CCC ATA GAG CCG GAA AAG GAT ATA AAG AAC ATT 
Cys Val Asp Val Leu Pro lie Glu Pro Glu Lys Asp He Lys Asn He 
180 185 190 

CTT CAC ATC CTT TTG AGG AGC TTC TTT CTT GCG GTC CGC TCA AAC TCC 
Leu His He Leu Leu Arg Ser Phe Phe Leu Ala Val Arg Ser Asn Ser 
195 200 205 

GAA AAG AGA AAG GAG TTT TGT GAC CTC GTT ATA GTT CCT GAG CTT GAG 
Glu Lys Arg Lys Glu Phe Cys Asp Leu Val lie Val Pro Glu Leu Glu 
210 215 220 

GAG TTC ACA CCC CTT GAT GTT AGA AAA GCG GAC CAA ATA ATG GAG AGG 
Glu Phe Thr Pro Leu Asp Val Arg Lys Ala Asp Gin He Met Glu Arg 
225 230 235 240 

GGA TAC ATA AAG GCC TTA GAG GTA CTT TCT GAA TAG 
Gly Tyr He Lys Ala Leu Glu Val Leu Ser Glu 
245 250 


(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...891 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 


ATG TTT AAT ATC AAT GTC TTT GTT AAT ATA TCT TGG CTG TAT TTT TCA 
Met Phe Asn He Asn Val Phe Val Asn He Ser Trp Leu Tyr Phe Ser 
15 10 15 

GGG ATA GTT ATG AAG ACT GTG GAA GAG TAT GCG CTA CTT GAA ACA GGC 
Gly lie Val Met Lys Thr Val Glu Glu Tyr Ala Leu Leu Glu Thr Gly 
20 25 30 
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10 


15 


30 


50 


55 


GTA AGA GTG TTT TAT CGG TGT GTA ATC CCG GAG AAA GCT TTT AAC ACT 14 4 

Val Arg Val Phe Tyr Arg Cys Val lie Pro Glu Lys Ala Phe Asn Thr 
35 40 45 

TTG ATA ATA GGT TCA CAC GGA TTG GGG GCG CAC AGT GGA ATC TAC ATT 192 
Leu He lie Gly Ser His Gly Leu Gly Ala His Ser Gly He Tyr He 
50 55 60 

AGT GTT GCT GAA GAA TTT GCT AGG CAC GGA TTT GGA TTC TGC ATG CAC 240 
Ser Val Ala Glu Glu Phe Ala Arg His Gly Phe Gly Phe Cys Met His 
65 70 75 80 

GAT CAA AGG GGA CAT GGG AGA ACG GCA AGC GAT AGA GAA AGA GGG TAT 288 
Asp Gin Arg Gly His Gly Arg Thr Ala Ser Asp Arg Glu Arg Gly Tyr 
85 90 95 


GTG GAG GGC TTT CAC AAC TTC ATA GAG GAT ATG AAG GCC TTC TCC GAT 336 
Val Glu Gly Phe His Asn Phe He Glu Asp Met Lys Ala Phe Ser Asp 
20 100 105 110 

TAT GCC AAG TGG CGC GTG GGA GGT GAC GAA ATA ATA TTG CTA GGA CAC 384 

Tyr Ala Lys Trp Arg Val Gly Gly Asp Glu lie lie Leu Leu Gly His 
115 120 125 

25 

AGT ATG GGC GGG CTG ATA GCG CTC TTA ACA GTT GCA ACT TAT AAA GAA 432 

Ser Met Gly Gly Leu He Ala Leu Leu Thr Val Ala Thr Tyr Lys Glu 
130 135 140 

ATC GCC AAG GGA GTT ATC GCG CTA GCC CCG GCC CTC CAA ATC CCC TTA 4 80 

He Ala Lys Gly Val lie Ala Leu Ala Pro Ala Leu Gin lie Pro Leu 
145 150 155 160 

ACC CCG GCT AGA AGA CTT GTT CTA AGC CTC GCG TCA AGG CTT GCC CCG 528 
Thr Pro Ala Arg Arg Leu Val Leu Ser Leu Ala Ser Arg Leu Ala Pro 
35 165 170 175 

CAT TCT AAG ATC ACC TTA CAA AGG AGA TTG CCG CAG AAA CCA GAG GGT 576 

His Ser Lys He Thr Leu Gin Arg Arg Leu Pro Gin Lys Pro Glu Gly 
180 . 185 190 

40 

TTT CAA AGA GCA AAA GAT ATA GAA TAC AGT CTG AGT GAA ATA TCA GTC 624 

Phe Gin Arg Ala Lys Asp He Glu Tyr Ser Leu Ser Glu He Ser Val 
195 200 205 

AAG CTC GTG GAC GAA ATG ATT AAA GCA TCA TCT ATG TTC TGG ACC ATA 672 
45 Lys Leu Val Asp Glu Met lie Lys Ala Ser Ser Met Phe Trp Thr lie 

210 215 220 

GCA GGG GAA ATT AAT ACT CCC GTC CTG CTT ATT CAT GGG GAA AAA GAC 720 
Ala Gly Glu He Asn Thr Pro Val Leu Leu lie His Gly Glu Lys Asp 
225 230 235 240 


AAT GTC ATA CCT CCG GAG GCG AGC AAA AAA GCC TAC CAA TTA ATA CCT 768 
Asn Val lie Pro Pro Glu Ala Ser Lys Lys Ala Tyr Gin Leu lie Pro 
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245 250 255 

TCA TTC CCT AAA GAG TTG AAA ATA TAC CCC GAT CTT GGA CAC AAC TTG 816 
Ser Phe Pro Lys Glu Leu Lys lie Tyr Pro Asp Leu Gly His Asn Leu 
260 265 270 

TTT TTT GAA CCA GGC GCG GTG AAA ATC GTC ACA GAC ATT GTA GAG TGG 864 
Phe Phe Glu Pro Gly Ala Val Lys He Val Thr Asp He Val Glu Trp 
275 280 285 

GTT AAG AAT CTA CCC AGG GAA AAT CCT TAA 894 
Val Lys Asn Leu Pro Arg Glu Asn Pro 
290 295 


(2) INFORMATION FOR SEQ ID NO:28: 
(i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (H) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...786 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 


40 


45 


50 


55 


ATG 

GAG 

GTT 

TAC 

AAG 

GCC 

AAA 

TTC 

GGC GAA GCA 

AAG 

CTC GGC TGG GTC 

48 

Met 

Glu 

Val 

Tyr 

Lys 

Ala 

Lys 

Phe 

Gly Glu Ala 

Lys 

Leu Gly Trp Val 


1 




5 




10 


15 


GTT 

CTG 

GTT 

CAT 

GGC 

CTC 

GGC 

GAG 

CAC AGC GGA 

AGG 

TAT GGA AGA CTG 

96 

Val 

Leu 

Val 

His 

Gly 

Leu 

Gly 

Glu 

His Ser Gly Arg 

Tyr Gly Arg Leu 





20 





25 


30 


ATT 

AAG 

GAA 

CTC 

AAC 

TAT 

GCC 

GGC 

TTT GGA GTT 

TAC 

ACC TTC GAC TGG 

144 

He 

Lys 

Glu 

Leu 

Asn 

Tyr 

Ala 

Gly 

Phe Gly Val 

Tyr 

Thr Phe Asp Trp 




35 





40 



45 


CCC 

GGC 

CAC 

GGG 

AAG 

AGC 

CCG 

GGC 

AAG AGA GGG 

CAC 

ACG AGC GTC GAG 

192 

Pro 

Gly 

His 

Gly 

Lys 

Ser 

Pro 

Gly 

Lys Arg Gly His 

Thr Ser Val Glu 



50 





55 



60 



GAG 

GCG 

ATG 

GAA 

ATC 

ATC 

GAC 

TCG 

ATA ATC GAG 

GAG 

ATC AGG GAG AAG 

240 

Glu 

Ala 

Met 

Glu 

He 

He 

Asp 

Ser 

He He Glu 

Glu 

He Arg Glu Lys 


65 





70 



75 


80 


CCC 

TTC 

CTC 

TTC 

GGC 

CAC 

AGC 

CTC 

GGT GGT CTA 

ACT 

GTC ATC AGG TAC 

288 

Pro 

Phe 

Leu 

Phe 

Gly 

His 

Ser 

Leu 

Gly Gly Leu 

Thr 

Val He Arg Tyr 






85 




90 


95 


GCT 

GAG 

ACG 

CGG 

CCC 

GAT 

AAA 

ATA 

CGG GGA TTA 

ATA 

GCT TCC TCG CCT 

336 
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Ala Glu Thr Arg Pro Asp Lys lie Arg Gly Leu He Ala Ser Ser Pro 
100 105 110 

5 GCC CTC GCC AAG AGC CCG GAA ACG CCG GGC TTC ATG GTG GCC CTC GCG 384 

Ala Leu Ala Lys Ser Pro Glu Thr Pro Gly Phe Met Val Ala Leu Ala 
115 120 125 

AAG TTC CTT GGA AAG ATC GCC CCG GGA GTT GTT CTC TCC AAC GGC ATA 432 
w Lys Phe Leu Gly Lys He Ala Pro Gly Val Val Leu Ser Asn Gly He 

130 135 140 

AAG CCG GAA CTC CTC TCG AGG AAC AGG GAC GCC GTG AGG AGG TAG GTT 480 
Lys Pro Glu Leu Leu Ser Arg Asn Arg Asp Ala Val Arg Arg Tyr Val 
15 145 150 155 160 

GAA GAC CCA CTC GTC CAC GAC AGG ATT TCG GCC AAG CTG GGA AGG AGC 528 
Glu Asp Pro Leu Val His Asp Arg He Ser Ala Lys Leu Gly Arg Ser 
165 170 175 

ATC TTC GTG AAC ATG GAG CTG GCC CAC AGG GAG GCG GAC AAG ATA AAA 576 
He Phe Val Asn Met Glu Leu Ala His Arg Glu Ala Asp Lys He Lys 
180 185 .190 

GTC CCG ATC CTC CTT CTG ATC GGC ACT GGC GAT GTA ATA ACC CCG CCT 624 
Val Pro lie Leu Leu Leu lie Gly Thr Gly Asp Val He Thr Pro Pro 
195 200 205 

GAA GGC TCA CGC AGA CTC TTC GAG GAG CTG GCC GTC GAG AAC AAA ACC 672 

Glu Gly Ser Arg Arg Leu Phe Glu Glu Leu Ala Val Glu Asn Lys Thr 
210 215 220 

30 

CTG AGG GAG TTC GAG GGG GCG TAC CAC GAG ATA TTT GAA GAC CCC GAG 720 

Leu Arg Glu Phe Glu Gly Ala Tyr His Glu He Phe Glu Asp Pro Glu 
225 230 235 240 

35 TGG GCC GAG GAG TTC CAC GAA ACA ATT GTT AAG TGG CTG GTT GAA AAA 768 

Trp Ala Glu Glu Phe His Glu Thr lie Val Lys Trp Leu Val Glu Lys 
245 250 255 


20 


25 


40 


55 


TCG TAC TCT TCG GCT CAA TAA 789 
Ser Tyr Ser Ser Ala Gin 
260 


(2) INFORMATION FOR SEQ ID NO:29: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
so (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 


(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...747 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 


5 


15 


20 


30 


35 


40 


45 


TTG 

ATT 

GGC 

AAT 

TTG 

AAA 

TTG 

AAG 

AGG 

TTT 

GAA 

GAG 

GTT 

AAC 

TTA 

GTT 

48 

Leu 

lie Gly 

Asn 

Leu 

Lys 

Leu 

Lys 

Arg 

Phe 

Glu 

Glu 

Val 

Asn 

Leu 

Val 


1 




5 





10 





15 



CTT 

TCG 

GGA 

GGG 

GCT 

GCC 

AAG 

GGT 

ATC 

GCC 

CAT 

ATA 

GGT 

GTT 

TTA 

AAA 

96 

Leu 

Ser Gly 

Gly 

Ala 

Ala 

Lys 

Gly 

He 

Ala 

His 

He 

Gly 

Val 

Leu 

LVS 





20 





25 





30 




GCT 

CTG 

GAA 

GAG 

CTC 

GGT 

ATA 

AAG 

GTA 

AAG 

AGG 

CTC 

AGC 

GGG 

GTA 

AGT 

144 

Ala 

Leu 

Glu 

Glu 

Leu 

Glv 
vaxy 

He 

Lys 

Val 

Lys Arg Leu 

Ser 

Glv 
uxy 

Val 

Ser 




35 





40 





45 





GCT 

GGA 

GCT 

ATC 

GTT 

TCC 

GTC 

TTT 

TAC 

GCT 

TCG 

GGC 

TAC 

ACT 

CCC 

GAC 

192 

Al A 

Gly Ala 

He 

Val 

Ser 

Val 

Phe 

i yx 

Ala 

Ser Gly 

Tvr 
lyt 

Thr 

C X<J 

Asp 



50 





55 





60 






GAG 

ATG 

TTA 

AAA 

CTC 

CTG 

AAA 

GAG 

GTA 

AAC 

TGG 

CTC 

AAA 

CTT 

TTT 

AAG 

240 


Met 

Leu 

Lys 

Leu 


Lys 

Glu 

Val 

Asn Trp Leu 

Lys 


Phe 



65 





70 





75 





80 


TTP 
I 1L 

AAA 

ACA 

CCG 

AAA 

ATG 

GGC 

TTA 

ATG 

GGG 

TGG 

GAG 

AAG 




lOO 

Phe 

Lys 

Thr 

Pro 

Lys 

Met 

Glv 

Leu 

Met 

Gly Trp Glu 

Lys 

Ala 

Ala 

Glu 






85 





90 





95 



TTT 

TTG 

GAA 

rtrvtt 




ui 1 

nttb 

AGG 

CTG 

GAA 

rar 


aar 

21T2V 

Job 


Leu 

Glu 

Lys 

O J. II 

Leu 

ui y 

VOX 


Arg 

Leu 

Glu 

Asp 

Leu 

Asn 

He 





100 





105 





110 





ACC 

TAT 



1 ljrV7 


Un A 


TAC 

ACG 

GGA 

aar; 



Tar 


Pro 

Thr 

Tyr 

Leu 

Cy s 

OCX. 


Asp 

Leu 

Tyr Thr Gly 

Lys 

Al ;» 
r\X ct 

Leu 

Tyr 




115 





120 





125 





TTC 

GGC 

AGA 

GGT 

GAC 

TTA 

ATT 

CCC 

GTG 

CTT 

CTC 

GGA 

AGT 

TGT 

TCC 

ATA 

432 


Gly Arg 

ui y 

Asp 

Leu 

Tip 

x x e 

Pro 

v ax 

Leu Leu Gly 


Cys 


T 1 n 

xxe 



130 





135 





140 






ccc 

GGG 

ATT 

TTT 

GAA 

CCA 

GTT 

GAG 

TAC 

GAG 

AAT 

TTT 

CTA 

CTT 

GTT 

GAC 

480 

Pro 

Gly 

He 

Phe 

Glu 

Pro 

Val 

Glu 

j. yx 

Glu 

Asn 

Phe 

Leu 

Leu 

VaI 

VOX 

Asp 


145 





150 





155 





160 


GGA 

GGT 

ATA 

GTG 

AAC 

AAC 

CTG 

CCC 

GTA 

GAA 

CCT 

TTG 

GAA 

AAG 

TTC 

AAA 

528 

Gly 

Gly He 

Val 

Asn 

Asn 

Leu 

Pro 

Val 

Glu 

Pro 

Leu 

Glu 

Lys 

Phe 

Lys 






165 





170 





175 



GAA 

CCC 

ATA 

ATC 

GGG 

GTA 

GAT 

GTG 

CTT 

CCC 

ATA 

ACT 

CAA 

GAA 

AGA 

AAG 

576 

Glu 

Pro 

He 

He 

Gly 

Val 

Asp 

Val 

Leu 

Pro 

He 

Thr 

Gin 

Glu 

Arg 

Lys 





180 





185 





190 




ATT 

AAA 

AAT 

ATA 

CTC 

CAC 

ATC 

CTT 

ATA 

AGG 

AGC 

TTC 

TTT 

CTG 

GCG 

GTT 

624 

He 

Lys Asn 

He 

Leu 

His 

He 

Leu 

He 

Arg 

Ser 

Phe 

Phe 

Leu 

Ala 

Val 



195 200 205 
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CGT TCC AAT TCG GAA AAG AGA AAG GAG TTC TGC AAC GTA GTT ATA GAA 
Arg Ser Asn Ser Glu Lys Arg Lys Glu Phe Cys Asn Val Val lie Glu 
210 215 220 

CCT CCC CTT GAA GAG TTC TCT CCT CTG GAC GTA AAT AAG GCG GAC GAG 
Pro Pro Leu Glu Glu Phe Ser Pro Leu Asp Val Asn Lys Ala Asp Glu 
225 230 235 240 

ATA TTC TGC GGG GAT ATG AGA GCA CTT TAA 
lie Phe Cys Gly Asp Met Arg Ala Leu 
245 


(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1014 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 


ATG CCA GCT AAT GAC TCA CCC ACG ATC GAC TTT AAT CCT CGC GGC ATT 
Met Pro Ala Asn Asp Ser Pro Thr lie Asp Phe- Asn Pro Arg Gly lie 
15 10 15 


CTT CGC AAC GCT CAC GCA CAG GTT ATT TTA GCG ACT TCC GGC TTG CGC 
Leu Arg Asn Ala His Ala Gin Val lie Leu Ala Thr Ser Gly Leu Arg 
20 25 30 

AAA GCG TTT TTG AAA CGC ACG CAC AAG AGC TAC CTC AGC ACT GCC CAA 
Lys Ala Phe Leu Lys Arg Thr His Lys Ser Tyr Leu Ser Thr Ala Gin 
35 40 45 

TGG CTG GAG CTC GAT GCC GGC AAC GGA GTT ACC TTG GCC GGA GAG CTT 
Trp Leu Glu Leu Asp Ala Gly Asn Gly Val Thr Leu Ala Gly Glu Leu 
50 55 60 

AAC ACA GCG CCT GCA ACT GCA TCC TCC TCC CAC CCG GCG CAC AAG AAC 
Asn Thr Ala Pro Ala Thr Ala Ser Ser Ser His Pro Ala His Lys Asn 
65 70 75 80 

ACT CTG GTT ATT GTG CTG CAC GGC TGG GAA GGC TCC AGC CAG TCG GCC 
Thr Leu Val lie Val Leu His Gly Trp Glu Gly Ser Ser Gin Ser Ala 
85 90 95 
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w 


15 


TAT GCG ACC TCC GCT GGC AGC ACG CTT TTC GAC AAT GGG TTC GAC ACT 336 
Tyr Ala Thr Ser Ala Gly Ser Thr Leu Phe Asp Asn Gly Phe Asp- Thr 
100 105 110 

TTT CGC CTT AAT TTT CGC GAT CAC GGC GAC ACC TAC CAC TTA AAC CGC 384 

Phe Arg Leu Asn Phe Arg Asp His Gly Asp Thr Tyr His Leu Asn Arg 
115 120 125 

GGC ATA TTT AAC TCA TCG CTG ATT GAC GAA GTA GTG GGC GCA GTC AAA 432 

Gly lie Phe Asn Ser Ser Leu lie Asp Glu Val Val Gly Ala Val Lys 
130 135 140 

GCC ATC CAG CAG CAA ACC GAC TAC GAC AAG TAT TGC CTG ATG GGG TTC 480 

Ala lie Gin Gin Gin Thr Asp Tyr Asp Lys Tyr Cys Leu Met Gly Phe 

145 150 155 160 

TCA CTG GGT GGG AAC TTT GCC TTG CGC GTC GCG GTG CGG GAA CAG CAT 528 

Ser Leu Gly Gly Asn Phe Ala Leu Arg Val Ala Val Arg Glu Gin His 
165 170 175 

20 CTC GCT AAA CCG CTA GCG GGC GTG CTC GCC GTA TGC CCG GTA CTC GAC 57 6 

Leu Ala Lys Pro Leu Ala Gly Val Leu Ala Val Cys Pro Val Leu Asp 
180 185 190 

CCC GCA CAC ACC ATG ATG GCC CTA AAC CGA GGT GCG TTT TTC TAC GGC 624 

25 Pro Ala His Thr Met Met Ala Leu Asn Arg Gly Ala Phe Phe Tyr Gly 

195 200 205 

CGC TAT TTT GCG CAT AAA TGG AAG CGC TCG TTA ACC GCA AAA CTT GCA 672 

Arg Tyr Phe Ala His Lys Trp Lys Arg Ser Leu Thr Ala Lys Leu Ala 
210 215 220 

30 

GCT TTC CCA GAC TAC AAA TAC GGC AAA GAT TTA AAA TCG ATA CAC ACG 720 

Ala Phe Pro Asp Tyr Lys Tyr Gly Lys Asp Leu Lys Ser lie His Thr 

225 230 235 240 

35 CTT GAT GAG TTA AAC AAC TAT TTC ATT CCC CGC TAC ACC GGC TTC AAC 768 

Leu Asp Glu Leu Asn Asn Tyr Phe lie Pro Arg Tyr Thr Gly Phe Asn 
245 250 255 

TCA GTC TCC GAA TAC TTC AAA AGT TAC ACG CTC ACC GGG CAG AAG CTC 816 

Ser Val Ser Glu Tyr Phe Lys Ser Tyr Thr Leu Thr Gly Gin Lys Leu 
260 265 270 

GCG TTT CTC AAC TGC CCC AGT TAC ATT CTG GCA GCT GGC GAC GAC CCA 864 

Ala Phe Leu Asn Cys Pro Ser Tyr lie Leu Ala Ala Gly Asp Asp Pro 
275 280 285 

ATA ATT CCA GCA TCC GAC TTT CAG AAA ATA GCC AAG CCT GCG AAT CTG 912 

lie lie Pro Ala Ser Asp Phe Gin Lys lie Ala Lys Pro Ala Asn Leu 
290 295 300 

CAC ATA ACA GTA ACG CAA CAA GGT TCT CAT TGC GCA TAC CTG GAA AAC 960 

His lie Thr Val Thr Gin Gin Gly Ser His Cys Ala Tyr Leu Glu Asn 

305 310 315 320 


40 


45 


50 


55 
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CTG CAT AAA CCT AGT GCT GCC GAC AAA TAT GCG GTG AAA TTA TTT GGA 1008 
Leu His Lys Pro Ser Ala Ala Asp Lys Tyr Ala Val Lys Leu Phe'Gly * 
5 325 " 330 335 

GCC TGT TGA 1017 
.Ala Cys 

(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

15 

(A) LENGTH: 936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

25 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. ..933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

30 


45 


55 


ATG 

CTT 

GAT 

ATG 

CCA 

ATC 

GAC 

CCT 

GTT TAC 

TAC 

CAG CTT GCT 

GAG 

TAT 

48 

Met 

Leu 

Asp 

Met 

Pro 

He 

Asp 

Pro 

Val Tyr 

Tyr Gin Leu Ala 

Glu 

Tyr 


1 




5 




10 



15 



TTC 

GAC 

AGT 

CTG 

CCG 

AAG 

TTC 

GAC 

CAG TTT 

TCC 

TCG GCC AGA 

GAG 

TAC 

96 

Phe 

Asp 

Ser 

Leu 

Pro 

Lys 

Phe 

Asp 

Gin Phe 

Ser Ser Ala Arg 

Glu 

Tyr 





20 





25 


30. 




AGG 

GAG 

GCG 

ATA 

AAT 

CGA 

ATA 

TAC 

GAG GAG 

AGA 

AAC CGG CAG 

CTG 

AGC 

144 

Arg 

Glu 

Ala 

He 

Asn 

Arg. 

He 

Tyr 

Glu Glu 

Arg Asn Arg Gin 

Leu 

Ser 




35 





40 



4 5 




CAG 

CAT 

GAG 

AGG 

GTT 

GAA 

AGA 

GTT 

GAG GAC 

AGG 

ACG ATT AAG 

GGG 

AGG 

192 

Gin 

HIS 

Glu 

Arg 

Val 

Glu 

Arg 

Val 

Glu Asp 

Arg 

Thr He Lys 

Gly 

Arg 



50 





55 




60 




AAC 

GGA 

GAC 

ATC 

AGA 

GTC 

AGA 

GTT 

TAC CAG 

CAG 

AAG CCC GAT 

TCC 

CCG 

. 240 

Asn 

Gly 

Asp 

He 

Arg 

Val 

Arg 

Val 

Tyr Gin 

Gin 

Lys Pro Asp 

Ser 

Pro 


65 





.70 




75 



80 


GGT 

CTG 

GTT 

TAC 

TAT 

CAC 

GGT 

GGT 

GGA TTT 

GTG 

ATT TGC AGC 

ATC 

GAG 

288 

Val 

Leu 

Val 

Tyr 

Tyr 

His 

Gly 

Gly 

Gly Phe 

Val 

He Cys Ser 

He 

Glu 






85 




90 



95 



TCG 

CAC 

GAC 

GCC 

TTA 

TGC 

AGG 

AGA AYY GCG 

AGA 

CTT TCA AAC 

TCT 

ACC 

■ 336 

Ser 

His 

Asp 

Ala 

Leu 

Cys 

Arg 

Arg 

He Ala 

Arg 

Leu Ser Asn 

Ser 

Thr 





100 





105 


1*10 




GTA 

GTC 

TCC 

GTG 

GAT 

TAC 

AGG 

CTC 

GCT CCT 

GAG 

CAC AAG TTT 

CCC 

CCC 

384 

Val 

Val 

Ser 

Val 

Asp 

Tyr 

Arg 

Leu 

Ala Pro 

Glu 

His Lys Phe 

Pro 

Ala 
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115 120 125 

CCA GTT TAT CAT TGC TAC GAT GCG ACC AAG TGG GTT GCT GAG AAC CGG 
Ala Val Tyr Asp Cys Tyr Asp Ala Thr Lys Trp VaJ. Ala Glu Asn Ala- 
130 135 140 " 

GAG GAG CTG AGG ATT GAC CCG TCA AAA ATC TTC GTT GGG GGG GAC AGT 
Glu Glu Leu Arg lie Asp Pro Ser Lys He Phe Val Gly Gly Asp Ser 
145 150 155 " 160 

GCG GGA CG AAT CTT GCC CGG GCG CTT TCA ATA ATG GCG AGA GAC AGC 
Ala Gly Gly Asn Leu Ala Ala Ala Val Ser He Met Ala Arg Asp Ser 
165 * 170 175 

GGA GAA GAT TTC ATA AAG CAT CAA ATT CTA ACT TAC CCC GTT GTG AAC 
Gly Glu Asp Phe He Lys His Gin He Leu He Tyr Pro Val Val Asn 
180 185 190 • 

TTT GTA GCC CCC ACA CCA TCG CTT CTG GAG TTT GGA GAG GGG CTG TGG 
Phe Val Ala Pro Thr Pro Ser Leu Leu" Glu. Phe Gly Glu Gly Leu Trp 
195 200 205 

ATT CTC GAC CAG AAG ATA ATG AGT TGG TTC TCG GAG CAG TAC TTC TCC 
He Leu Asp Gin Lys He Met Ser Trp Phe Ser Glu Gin Tyr Phe Ser 
210 215 220 

AGA. GAG GAA GAT AAG TTC AAG CCC CTC GCC TCC GTA ATC TTT GCG GAC 
Arg Glu Glu Asp Lys Phe Asn Pro Leu Ala Ser Val He Phe Ala Asp 
225 230 235 240 

CTT GAG AAC CTA CCT CCT GCG CTG ATC ATA ACC GCC GAA TAC GAC CCG 
Leu Glu Asn Leu Pro Pro Ala Leu He He Thr Ala Glu Tyr Asp Pro 
245 250 255 

CTG AGA GAT GAA GGA GAA GTT TTC GGG CAG ATG CTG AGA AGA GCC GGT 
Leu Arg Asp Glu Gly Glu Val Phe Gly Gin Met Leu Arg Arg Ala Gly 
260 265 270 

GTT GAG GCG AGC ATC GTC AGA TAC AGA GGG GTG CTT CAC GGA TTC ATC 
Val Glu Ala Ser He Val Arg Tyr Arg Gly Val Leu His Gly Phe He 
275 280 285 

AAT TAC TAT CCC GTG CTG AAG GCT GCG AGG GAT GCG ATA AAC CAG ATT 
Asn Tyr Tyr Pro Val Leu Lys Ala Ala Arg Asp Ala He Asn Gin He 
. 290 295 300 

GCC GCT CTT CTT GTG TTC GAC TAG 
Ala Ala Leu LeuJVal Phe Asp 
305 310 


(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...915 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 


ATG CCC CTA GAT CCT AGA ATT AAA AAG TTA CTA GAA TCA GCT CTT ACT 
Met Pro" Leu Asp Pro Arg lie Lys Lys Leu Leu Glu Ser Ala Leu Thr 
1 5 10 15 

ATA CCA ATT GGT AAA GCC CCA GTA GAA GAG GTA AGA AAG ATA TTT AGG 
lie Pro lie Gly Lys Ala Pro Val Glu Glu Val Arg Lys lie Phe Arg 

- - 20 25 30 

CAA TTA GCG TCG GCA GCT CCC AAA GTC GAA GTT GGA AAA GTA GAA GAT 
Gin Leu Ala Ser Ala Ala Pro Lys Val Glu Val Gly Lys Val Glu Asp' 
35 40 45 

ATA AAA ATA CCA GGC AGT GAA ACC GTT ATA AAC GCT AGA GTG TAT TTT 
lie Lys lie Pro Gly Ser Glu Thr Val lie Asn Ala Arg Val Tyr Phe 
50 55 60 

CCG AAG AGT AGC GGT CCT TAT GGT GTT CTA GTG TAT CTT CAT GGA GGC 
Pro Lys Ser Ser Gly Pro Tyr Gly Val Leu Val Tyr Leu His Gly Gly 
65 70 75 80 

GGT TTT GTA ATA GGC GAT GTG GAA TCT TAT GAC CCA TTA TGT AGA GCA 
Gly Phe Val lie Gly Asp Val Glu Ser Tyr Asp Pro Leu Cys Arg Ala 
85 90 95 

ATT ACA AAT GCG TGC AAT TGC GTT GTA GTA TCA GTG GAC TAT AGG TTA 
lie Thr Asn Ala Cys Asn Cys Val Val Val Ser Val Asp Tyr Arg Leu 
100 105 110 

GCT CCA GAA TAC AAG TTT CCT TCT GCA GTT ATC GAT TCA TTT GAC GCT 
Ala Pro Glu Tyr Lys Phe Pro Ser Ala Val He Asp Ser Phe Asp Ala 
115 120 125 

ACT AAT TGG GTT TAT AAC AAT TTA GAT AAA TTT GAT GGA AAG ATG GGA 
Thr Asn Trp Val Tyr Asn Asn Leu Asp Lys Phe Asp Gly Lys Met Gly 
130 135 140 

GTT GCG ATT GCG GGA GAT AGT GCT GGA GGA AAT TTG GCA GCG GTT GTA 
Val Ala He Ala Gly AsjJ' Ser Ala Gly Gly Asn Leu Ala Ala Val Val 
145 150 155 160 

GCT CTT CTT TCA AAG GGT AAA ATT AAT TTG AAG TAT CAA ATA CTG GTT 
Ala Leu Leu Ser Lys Gly Lys He Asn Leu Lys Tyr Gin He Leu Val 
165 170 175 


97 


EP 0 880 590 B1 


TAC CCA GCG GTA AGT TTA GAT AAC GTT TCA AGA TCC ATG ATA GAG TAC 
Tyr Pro Ala Val Ser Leu Asp Asn Val Ser Arg Ser Met lie Glu Tyr 
180 185 190 

TCT GAT GGG TTC TTC CTT ACC AGA GAG CAT ATA GAG TGG TTC GGT TCT 
Ser Asp Gly Phe Phe Leu Thr Arg Glu His He Glu Trp Phe Gly Ser 
195 200 205 

CAA TAC TTA CGA AGC CCT GCA GAT TTG CTA GAC TTT AGG TTC TCT CCA 
Gin Tyr Leu Arg Ser Pro Ala Asp Leu Leu Asp Phe Arg Phe Ser Pro 
210 215 220 

ATT CTG GCG CAA GAT TTC AAC GGA TTA CCT CCA GCC TTG ATA ATA ACA 
He Leu Ala Gin Asp Phe Asn Gly Leu Pro Pro Ala Leu He He Thr 
225 230 235 240 

GCA GAA TAC GAT CCA CTA AGG GAT CAA GGA GAA GCG TAT GCA AAT AAA 
Ala Glu Tyr Asp Pro Leu Arg Asp Gin Gly Glu Ala Tyr Ala Asn Lys 
245 250 255 

CTA CTA CAA GCT GGA GTC TCA GTT ACT AGT GTG AGA TTT AAC AAC GTT 
Leu Leu Gin Ala Gly Val Ser Val Thr Ser Val Arg Phe Asn Asn Val 
260 265 270 

ATA CAC GGA TTC CTC TCA TTC TTT CCG TTG ATG GAG CAA GGA AGA GAT 
He His Gly Phe Leu Ser Phe Phe Pro Leu Met Glu Gin Gly Arg Asp 
275 280 285 

GCT ATA GGT CTG ATA GGG TCT GTG TTA AGA CGA GTA TTT TAT GAT AAA 
Ala lie Gly Leu He Gly Ser Val Leu Arg Arg Val Phe Tyr Asp Lys 
290 295 300 

ATT TAA 

He 

305 


(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 


Met Ser Leu Asn Lys His Ser Trp 

1 5 
Ser Phe Ser Phe Pro Leu Thr Met 
20 

Ser Trp Phe Asn He Trp Asn Asn 
35 40 


Met Asp Met He He Phe He Leu 

10 15 
He-Ala Leu Ala He Ser Met Ser 

25 30 
Ala Leu Ser Asp Leu Gly His Ala 
45 
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5 


15 


val 

Lys 

Ser 

Ser 

Val 

Ala 

Pro 

He Phe 

Asn 

Leu Gly Leu Ala lie Gly 


50 





55 



60 


Gly 

He 

Leu 

He 

Val 

He 

Val 

Gly Leu 

Arg Asn Leu 

Tyr Ser Trp Ser 

65 





70 




75 

80 

Arg 

Val 

Lys 

Gly 

Ser 

Leu 

He 

He Ser 

Met 

Gly Val 

Phe Leu Asn Leu 





85 




90 


95 

He 

Gly 

Val 

Phe 

Asp 

Glu 

Val 

Tyr Gly 

Trp 

lie His 

Phe Leu Val Ser 




100 




105 



110 

Val 

Leu 

Phe 

Phe 

Leu 

Ser 

lie 

He Ala 

Tyr 

Phe lie 

Ala lie Ser lie 



115 





120 



125 

Leu 

Asp 

Lys 

Ser 

Trp 

He 

Ala 

Val Leu 

Leu 

lie lie 

Gly His lie Ala 


130 





135 



140 


Met 

Trp 

Tyr 

Leu 

His 

Phe 

Ala 

Ser Glu 

lie 

Pro Arg 

Gly Ala Ala lie 

145 





150 




155 

160 

Pro 

Glu 

Leu 

Leu 

Ala 

Val 

Phe 

Ser Phe 

Leu 

Pro Phe 

Tyr He Arg Asp 





165 




170 


175 

Tyr 

Phe 

Lys 

Ser 

Tyr 

Thr 

Lys Arg 







180 









20 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 346 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 


35 



Met 

Lys 

Leu 

Leu 

Glu 

Pro 

Thr 

Asn 

Thr 

Ser 

Tyr 

Thr Leu 

Leu Gin 

Asp 


1 




5 





10 



15 



Leu 

Ala 

Leu 

His 

Phe 

Ala 

Phe 

Tyr 

Trp 

Phe 

Leu 

Ala Val 

Tyr Thr 

Trp 





20 





25 




30 

40 

Leu 

Pro 

Gly 

Val 

Leu 

Val 

Arg 

Gly 

Val 

Ala 

Val 

Asp Thr 

Gly Val 

Ala 



35 





40 




45 




Arg 

Val 

Pro 

Gly 

Leu 

Gly 

Arg Arg 

Gly 

Lys 

Arg 

Leu Leu 

Leu Ala 

Ala 



50 





55 





60 




Val 

Ala 

Val 

Leu 

Ala 

Leu 

Val 

Val 

Ser 

Val 

Val 

Val Pro 

Ala Tyr 

Val 


65 





70 





75 



80 

45 

Ala 

Tyr 

Ser 

Ser 

Leu 
85 

His 

Pro 

Glu 

Ser 

Cys 
90 

Arg 

Pro Val 

Ala Pro 
95 

Glu 


Gly 

Leu 

Thr 

Tyr 

Lys 

Glu 

Phe 

Ser 

Val 

Thr 

Ala 

Glu Asp Gly Leu 

Val 





100 





105 




110 



Val 

Arg 

Gly 

Trp 

Val 

Leu 

Gly 

Pro 

Gly 

Ala Gly Gly Asn 

Pro Val 

Phe 

50 



115 





120 




125 




Val 

Leu 

Met 

His 

Gly 

Tyr 

Thr Gly 

Cys 

Arg 

Ser 

Ala Pro 

Tyr Met 

Ala 



130 





135 





140 




val 

Leu 

Ala 

Arg 

Glu 

Leu 

Val 

Glu 

Trp 

Gly Tyr 

Pro Val 

Val Val 

Phe 


145 





150 





155 



160 

55 

Asp 

Phe 

Arg 

Gly 

His 

Gly 

Glu 

Ser 

Gly 

Gly 

Ser 

Thr Thr 

lie Gly 

Pro 





165 





170 



175 
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5 


Arg 

Glu 

Val 

Leu 
180 

Asp 

Ala Arg 

Ala Val 
185 

Val 

Gly 

Tyr 

Val 

Ser 
190 

Glu 

Arg 

Phe 

Pro 

Gly 
195 

Arg 

Arg 

lie He 

Leu Val 
200 

Gly 

Phe 

Ser 

Met 
205 

Gly 

Gly 

Ala 

Val 

Ala 
210 

lie 

Val 

Glu 

Gly Ala 
215 

Gly Asp 

Pro 

Arg 

Val 
220 

Tyr 

Ala 

Val 

Ala 

Ala 

Asp 

Ser 

Pro 

Tyr 

Tyr Arg 

Leu Arg 

Asp 

Val 

He 

Pro 

Arg 

Trp 

Leu 

225 





230 



235 





240 

Glu 

Tyr 

Lys 

Thr 

Pro 
245 

Leu Pro 

Gly Tip 

Val 
250 

Gly 

Val 

Leu 

Ala 

Gly 

255 

Phe 

Tyr 

Gly 

Arg 

Leu 
260 

Met 

Ala Gly 

Val Asp 
265 

Leu 

Gly 

Phe 

Gly 

Pro 
270 

Ala 

Gly 

Val 

Glu 

Arg 
275 

Val 

Asp 

Lys Pro 

Leu Leu 
280 

Val 

Val 

Tvr 

Gly 
285 

Pro 

Aro 

Asp 

Pro 

Leu 
290 

Val 

Thr 

Arg 

Asp Glu 
295 

Ala Arg 

Ser 

Leu 

Ala 
300 

Ser 

Arg 

Ser 

Pro 

Cys 

Gly 

Arg 

Leu 

Val 

Glu Val 

Pro Gly 

Ala 

Gly 

His 

Val 

Glu 

Ala 

Val 

305 





310 



315 





320 

Asp 

Val 

Leu 

Gly 

Pro 
325 

Gly Arg 

Tyr Ala 

Asp 
330 

Met 

Leu 

lie 

Glu 

Leu 
335 

Ala 

His 

Glu 

Glu 

Cys 
340 

Pro 

Pro Gly 

Ala Gly 
345 

Gly 








(2) INFORMATION FOR SEQ ID NO:35: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 
30 (0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 


40 


50 


55 


Met 

Pro 

Tyr 

Val 

Arg 

Asn 

Gly 

Gly Val 

Asn 

He 

Tyr 

Tyr 

Glu Leu 

Val 

1 




5 




10 




15 


Asp 

Gly 

Pro 

Glu 

Pro 

Pro 

He 

Val Phe 

Val 

His Gly 

Trp 

Thr Ala 

Asn 




20 




25 





30 


Met 

Asn 

Phe 
35 

Trp 

Lys 

Glu 

Gin 

Arg Arg 
40 

Tyr 

Phe 

Ala 

Gly 
45 

Arg Asn 

Met 

Met 

Leu 

Phe 

Val 

Asp 

Asn 

Arg 

Gly His 

Gly Arg 

Ser 

Asp 

Lys Pro 

Leu 


50 





55 




60 




Gly 

Tyr 

Asp 

Phe 

Tyr 

Arg 

Phe 

Glu Asn 

Phe 

He 

Ser 

Asp 

Leu Asp 

Ala 

65 





70 




75 




80 

Val 

Val 

Arg 

Glu 

Thr 
85 

Gly 

Val 

Glu Lys 

Phe 
90 

Val 

Leu 

val 

Gly His 
95 

Ser 

Phe 

Gly 

Thr 

Met 
100 

He 

Ser 

Met 

Lys Tyr 
105 

Cys 

Ser 

Glu 

Tyr 

Arg Asn 
110 

Arg 

Val 

Leu 

Ala 
115 

Leu 

He 

Leu 

He 

Gly Gly 
120 

Gly 

Ser 

Arg 

He 
125 

Lys Leu 

Leu 

His 

Arg 
130 

He 

Gly 

Tyr 

Pro 

Leu 
135 

Ala Lys 

He 

Leu 

Ala 
140 

Ser 

He Ala 

Tyr 


100 
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Lys 

Lys 

Ser 

Ser 

Arg 

Leu 

Val 

Ala Asp Leu Ser Phe Gly Lys Asn Ala 


145 





150 


155 lou 


Gly 

Glu 

Leu 

Lys 

Glu 

Trp 

Gly 

Trp Lys Gin Ala Met Asp Tyr Thr Pro 

5 





165 



170 175 


Ser 

Tyr 

val 

Ala 
180 

Met 

Tyr 

Thr 

Tyr Arg Thr Leu Thr Lys Val Asn Leu 
185 190 


Glu 

Asn 

He 
195 

Leu 

Glu 

Lys 

He 

Asp Cys Pro Thr Leu He He Val Gly 
200 205 

10 

Glu 

Glu 
210 

Asp 

Ala 

Leu 

Leu 

Pro 
215 

Val Ser Lys Ser Val Glu Leu Ser Arg 
220 


Arg 

He 

Glu 

Asn 

Ser 

Lys 

Leu 

Val He He Pro Asn Ser Gly His Cys 


225 





230 


235 240 


Val 

Met 

Leu 

Glu 

Ser 

Pro 

Ser 

Glu Val Asn Arg Ala Met Asp Glu Phe 

15 





245 



250 255 

He 

Ser 

Ser 

Ala 
260 

Gin 

Phe 




20 


(2) INFORMATION FOR SEQ ID NO:36: 
(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



Leu 

Arg 

Leu 

Arg 

Lys 

Phe 

Glu 

Glu He Asn Leu Val 

Leu Ser Gly Gly 

35 

1 




5 



10 

15 


Ala 

Ala 

Lys 

Gly 
20 

He 

Ala 

His 

He Gly Val Leu Lys 
25 

Ala He Asn Glu 
30 


Leu 

Gly 

He 
35 

Arg 

Val 

Arg 

Ala 

Leu Ser Gly Val Ser 
40 

Ala Gly Ala He 
45 

40 

Val 

Ser 
50 

Val 

Phe 

Tyr 

Ala 

Ser 
55 

Gly Tyr Ser Pro Glu 
60 

Gly Met Phe Ser 


Leu 

Leu 

Lys 

Arg 

Val 

Asn 

Trp 

Leu Lys Leu Phe Lys 

Phe Lys Pro Pro 


65 





70 


75 

80 


Leu 

Lys 

Gly 

Leu 

He 

Gly 

Trp 

Glu Lys Ala He Arg 

Phe Leu Glu Glu 

45 





85 



90 

95 

Val 

Leu 

Pro 

Tyr 
100 

Arg 

Arg 

He 

Glu Lys Leu Glu He 
105 

Pro Thr Tyr He 
110 


Cys 

Ala 

Thr 
115 

Asp 

Leu 

Tyr 

Ser 

Gly Arg Ala Leu Tyr 
120 

Leu Ser Glu Gly 
125 


Ser 

Leu 

He 

Pro 

Ala 

Leu 

Leu 

Gly Ser Cys Ala He 

Pro Gly He Phe 

50 


130 





135 

140 



Glu 

Pro 

Val 

Glu 

Tyr 

Lys 

Asn 

Tyr Leu Leu Val Asp 

Gly Gly He Val 


145 





150 


155 

160 


Asn 

Asn 

Leu 

Pro 

Val 
165 

Glu 

Pro 

Phe Gin Glu Ser Gly 
170 

He Pro Thr Val 
175 

55 

Cys 

Val 

Asp 

Val 

Leu 

Pro 

He 

Glu Pro Glu Lys Asp 

He Lys Asn He 




180 




185 

190 


Leu 

His 

He 
195 

Leu 

Leu 

Arg 

Ser 

Phe Phe Leu Ala Val 
200 

Arg Ser Asn Ser 
205 
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Glu Lys Arg Lys Glu Phe Cys Asp Leu Val He Val Pro Glu Leu Glu 

210 215 220 

Glu Phe Thr Pro Leu Asp Val Arg Lys Ala Asp Gin He Met Glu Arg 
. 225 230 235 240 

5 Gly Tyr He Lys Ala Leu Glu Val Leu Ser Glu 

245 250 

(2) INFORMATION FOR SEQ ID NO:37: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 



Met 

Phe 

Asn 

He 

Asn 

Val 

Phe 

val Asn 

He 

Ser Trp 

Leu 

Tyr 

Phe 

Ser 

25 

1 




5 




10 





15 



Gly 

He 

Val 

Met 
20 

Lys 

Thr 

Val 

Glu Glu 
25 

Tyr 

Ala 

Leu 

Leu 

Glu 
30 

Thr 

Gly 


Val 

Arg 

Val 

Phe 

Tyr 

Arg 

Cys 

Val He 

Pro 

Glu Lys 

Ala 

Phe 

Asn 

Thr 




35 





40 




45 




30 

Leu 

He 

He 

Gly 

Ser 

His 

Gly 

Leu Gly 

Ala 

His 

Ser 

Gly 

He 

Tyr 

He 


50 





55 




60 






Ser 

Val 

Ala 

Glu 

Glu 

Phe 

Ala 

Arg His 

Gly 

Phe Gly 

Phe 

Cys Met 

His 


65 





70 




75 





80 


Asp 

Gin 

Arg 

Gly 

His 

Gly 

Arg 

Thr Ala 

Ser 

Asp Arg 

Glu 

Arg Gly 

Tyr 






85 




90 





95 


35 

Val 

Glu 

Gly 

Phe 
100 

His 

Asn 

Phe 

He Glu 
105 

Asp 

Met 

Lys 

Ala 

Phe 
110 

Ser 

Asp 


Tyr 

Ala 

Lys 

Trp 

Arg 

Val 

Gly 

Gly Asp 

Glu 

He 

He 

Leu 

Leu Gly 

His 




115 





120 




125 





Ser 

Met 

Gly 

Gly 

Leu 

He 

Ala 

Leu Leu 

Thr 

Val 

Ala 

Thr 

Tyr 

Lys 

Glu 

40 


130 





135 




140 





He 
145 

Ala 

Lys 

Gly 

Val 

He 
150 

Ala 

Leu Ala 

Pro 

Ala 
155 

Leu 

Gin 

He 

Pro 

Leu 
160 


Thr 

Pro 

Ala 

Arg 

Arg 
165 

Leu 

Val 

Leu Ser 

Leu 
170 

Ala 

Ser 

Arg 

Leu 

Ala 
175 

Pro 

45 

His 

Ser 

Lys 

He 

Thr 

Leu 

Gin 

Arg Arg 

Leu 

Pro 

Gin 

Lys 

Pro 

Glu 

Gly 




180 




185 





190 




Phe 

Gin 

Arg 
195 

Ala 

Lys 

Asp 

He 

Glu Tyr 
200 

Ser 

Leu 

Ser 

Glu 
205 

He 

Ser 

Val 


Lys 

Leu 

Val 

Asp 

Glu 

Met 

He 

Lys Ala 

Ser 

Ser 

Met 

Phe 

Trp Thr 

He 



210 





215 




220 





50 

Ala 

Gly 

Glu 

He 

Asn 

Thr 

Pro 

Val Leu 

Leu 

He 

His 

Gly 

Glu 

Lys Asp 


225 





230 




235 





240 


Asn 

Val 

He 

Pro 

Pro 

Glu 

Ala 

Ser Lys 

Lys 

Ala Tyr 

Gin 

Leu 

He 

Pro 






245 




250 





255 



Ser 

Phe 

Pro 

Lys 

Glu 

Leu 

Lys 

He Tyr 

Pro 

Asp Leu 

Gly 

His 

Asn 

Leu 

55 




260 




265 





270 
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Phe Phe Glu Pro Gly Ala Val Lys He Val Thr Asp lie Val Glu Tip 

275 280 285 

Val Lys Asn Leu Pro Arg Glu Asn Pro 
290 295 


(2) INFORMATION FOR SEQ ID NO:38: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 


30 


40 


45 


50 


Met 

Glu 

Val 

Tyr 

Lys 

Ala 

Lys 

Phe 

Gly 

Glu 

Ala Lys Leu 

Gly Trp 

Val 

1 




5 





10 




15 


Val 

Leu 

Val 

His 

Gly Leu 

Gly Glu 

His 

Ser 

Gly Arg 

Tyr 

Gly Arg 

Leu 




20 





25 





30 


He 

Lys 

Glu 

Leu 

Asn Tyr 

Ala 

Gly 

Phe 

Gly 

Val 

Tyr 

Thr 

Phe Asp 

Trp 



35 





40 





45 



Pro 

Gly His 

Gly 

Lys 

Ser 

Pro 

Gly 

Lys 

Arg 

Gly His 

Thr 

Ser Val 

Glu 


50 





55 





60 




Glu 

Ala 

Met 

Glu 

lie 

He 

Asp 

Ser 

He 

He 

Glu 

Glu 

He 

Arg Glu 

Lys 

65 





70 





75 




80 

Pro 

Phe 

Leu 

Phe 

Gly 
85 

His 

Ser 

Leu 

Gly 

Gly 
90 

Leu 

Thr 

Val 

He Arg 
95 

Tyr 

Ala 

Glu 

Thr 

Arg 

Pro Asp 

Lys 

He 

Arg 

Gly 

Leu 

lie 

Ala 

Ser Ser 

Pro 




100 





105 





110 


Ala 

Leu 

Ala 
115 

Lys 

Ser 

Pro 

Glu 

Thr 
120 

Pro 

Gly 

Phe 

Met 

Val 
125 

Ala Leu 

Ala 

Lys 

Phe 
130 

Leu 

Gly 

Lys 

He 

Ala 
135 

Pro 

Gly 

Val 

Val 

Leu 
140 

Ser 

Asn Gly 

He 

Lys 

Pro 

Glu 

Leu 

Leu 

Ser 

Arg 

Asn 

Arg 

Asp 

Ala 

Val 

Arg 

Arg Tyr 

Val 

145 





150 





155 




160 

Glu 

Asp 

Pro 

Leu 

Val 

His 

Asp Arg 

He 

Ser 

Ala 

Lys 

Leu 

Gly Arg 

Ser 





165 





170 




175 


He 

Phe 

Val 

Asn 

Met 

Glu 

Leu 

Ala 

His 

Arg 

Glu Ala Asp 

Lys He 

Lys 




180 





185 





190 


Val 

Pro 

He 
195 

Leu 

Leu 

Leu 

He 

Gly 
200 

Thr 

Gly 

Asp 

Val 

He 
205 

Thr Pro 

Pro 

Glu 

Gly 
210 

Ser 

Arg 

Arg 

Leu 

Phe 
215 

Glu 

Glu 

Leu 

Ala 

Val 
220 

Glu 

Asn Lys 

Thr 

Leu 

Arg 

Glu 

Phe 

Glu Gly 

Ala 

Tyr 

His 

Glu 

He 

Phe 

Glu 

Asp Pro 

Glu 

225 





230 





235 




240 

Trp 

Ala 

Glu 

Glu 

Phe 
245 

His 

Glu 

Thr 

He 

Val 
250 

Lys 

Trp 

Leu 

Val Glu 
255 

Lys 

Ser 

Tyr 

Ser 

Ser 
260 

Ala 

Gin 











55 

(2) INFORMATION FOR SEQ ID NO:39: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 249 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

10 


20 


25 


30 


40 


Leu 

He 

Gly 

Asn 

Leu 

Lys 

Leu 

Lys Arg Phe 

Glu 

Glu 

Val 

Asn 

Leu 

Val 

1 




5 



10 





15 


Leu 

Ser 

Gly 

Gly 

Ala 

Ala 

Lys 

Gly He Ala 

His 

He 

Gly 

Val 

Leu 

Lys 




20 




25 




30 



Ala 

Leu 

Glu 

Glu 

Leu 

Gly 

He 

Lys Val Lys 

Arg 

Leu 

Ser 

Gly 

Val 

Ser 



35 





40 



45 




Ala 

Gly 

Ala 

He 

Val 

Ser 

Val 

Phe Tyr Ala 

Ser 

Gly 

Tyr 

Thr 

Pro 

Asp 


50 





55 



60 





Glu 

Met 

Leu 

Lys 

Leu 

Leu 

Lys 

Glu Val Asn 

Trp 

Leu 

Lys 

Leu 

Phe 

Lys 

65 





70 



75 





80 

Phe 

Lys 

Thr 

Pro 

Lys 

Met 

Gly 

Leu Met Gly 

Trp 

Glu 

Lys 

Ala 

Ala 

Glu 





85 



90 





95 


Phe 

Leu 

Glu 

Lys 

Glu 

Leu 

Gly 

Val Lys Arg 

Leu 

Glu 

Asp 

Leu 

Asn 

He 




100 




105 




110 



Pro 

Thr 

Tyr 

Leu 

Cys 

Ser 

Ala 

Asp Leu Tyr 

Thr 

Gly 

Lys 

Ala 

Leu 

Tyr 



115 





120 



125 




Phe 

Gly 

Arg 

Gly 

Asp 

Leu 

He 

Pro Val Leu 

Leu 

Gly 

Ser 

Cys 

Ser 

He 


130 





135 



140 





Pro 

Gly 

He 

Phe 

Glu 

Pro 

Val 

Glu Tyr Glu 

Asn 

Phe 

Leu 

Leu 

Val 

Asp 

145 





150 



155 





160 

Gly 

Gly 

He 

Val 

Asn 

Asn 

Leu 

Pro Val Glu 

Pro 

Leu 

Glu 

Lys 

Phe 

Lys 





165 



170 





175 


Glu 

Pro 

He 

He 

Gly 

Val 

Asp 

Val Leu Pro 

He 

Thr 

Gin 

Glu 

Arg 

Lys 




180 




185 




190 



He 

Lys 

Asn 

He 

Leu 

His 

He 

Leu He Arg 

Ser 

Phe 

Phe 

Leu 

Ala 

Val 



195 





200 



205 




Arg 

Ser 

Asn 

Ser 

Glu 

Lys 

Arg 

Lys Glu Phe 

Cys 

Asn 

Val 

Val 

He 

Glu 


210 





215 



220 





Pro 

Pro 

Leu 

Glu 

Glu 

Phe 

Ser 

Pro Leu Asp 

Val 

Asn 

Lys 

Ala 

Asp 

Glu 

225 





230 



235 





240 

He 

Phe 

Cys 

Gly 

Asp 

Met 

Arg 

Ala Leu 








245 


(2) INFORMATION FOR SEQ ID NO:40: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
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Met 

Pro 

Ala 

Asn Asp 

Ser 

Pro Thr He Asp Phe Asn 

Pro Arg 

Glv 

He 

1 



5 



10 



15 


Leu 

Arg Asn 

Ala Hi <; 

Ala 

Gin Val He 

Leu Ala 

Thr 

Ser Gly 

Leu 

A TTT 




20 


25 



30 



Lys 

Ala 

Phe 

T.^n T.uq 

-U W KA XJ y o 

Arg 

Thr His Lys 

Ser Tyr 

Leu 

Ser- Thr 

Ala 

Gin 



jj 



40 



45 



irp 

T on 

Glu 

T Ail ficn 

JUcu no^ 

Ala 

Gly Asn Gly Val Thr Leu 

Ala Gly 

Glu 







55 


60 




Asn 

Thr 

Ala 

Prn TV ) a 

nil 

Ala Ser Ser 

Ser His 

Pro 


Lys 

As n 





70 


75 




so 

inr 

Leu 

Val 

Tin U a 1 
lie Veil 

Leu 

His Gly Trp Glu Gly Ser 

OxiL. Old 


Al a 

mx a 







90 





Tyr 

Ala 

Thr 

Ser Ala 

laJLy 

Ser Thr Leu 

Phe Asp Asn 

uiy riixS 

Asp 

mr 




IvU 


105 



i in 

11U 



Dho 

Arg 

Leu 

& en Dh a 
Mail tile 

Arg 

Asp His Gly Asp Thr 

Tyr 

His Leu 

Asn 

Arg 



115 



120 



IOC 




He 

Phe 

rtsn Oct 

c o -r 

Leu He Asp Glu Val 

Val 

vaXy nla 

Val 

Lys 


130 




135 


140 




ALB 

He 

Gin 

k^in bin 

inr 

Asp Tyr Asp Lys Tyr Cys 

T ail ftAo^ 

Leu net. 

«xy 

.pne 

1 






155 




l fin 

low 

ser 

Leu Gly 

Gly Asn 

pne 

Ala Leu Arg 

Val Ala 

Val 

Arg Glu 

bin 

HIS 




IDj 



170 



X / D 


Leu 

Ala 

Lys 

Pro Leu 

Ala 

Gly Val Leu Ala Val 

Cys 

pro val 

Leu 

ASp 




loU 


185 



iyu 



Pro 

Ala 

His 

Tnr Met 

Met 

Ala Leu Asn Arg Gly Ala 

pne pne 

Tyr 

Gly 



195 



200 






Arg 

Tyr 

Phe 

Al a HI s 

T i m 

Lys 

Trp Lys Arg 

Ser Leu 

Thr 

Ala Lys 

Leu 

Ala 


210 




215 


220 




Ala 

Phe 

Pro 

Asp Tyr 

Lys 

Tyr Gly Lys Asp Leu 

Lys 

ser lie 

His 

Thr 





OTA 


235 




O A ft 
£. H \J 

Leu 

Asp Glu 

Leu Asn 

Asn 

Tyr Phe He 

Pro Arg 

Tyr 

inr oiy 

pne 

Asn 




OA 



250 



O 


Ser 

Val 

Ser 

Glu Tyr 

Phe 

Lys Ser Tyr 

Thr Leu 

Thr 

Gly Gin 

Lys 

Leu 






265 






Ala 

Phe 

Leu 

Asn Cys 

Pro 

Ser Tyr He 

Leu Ala 

Ala 

Gly Asp 

Asp 

Pro 



275 



280 



285 



He 

He 

Pro 

Ala Ser 

Asp 

Phe Gin Lys 

He Ala 

Lys 

Pro Ala 

Asn 

Leu 


290 




295 


300 




His 

He 

Thr 

Val Thr 

Gin 

Gin Gly Ser 

His Cys 

Ala 

Tyr Leu 

Glu 

Asn 

305 




310 


315 




320 

Leu 

His 

Lys 

Pro Ser 

Ala 

Ala Asp Lys 

Tyr Ala 

Val 

Lys Leu 

Phe 

Gly 




325 



330 



335 


Ala 

Cys 











(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
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Met 


Asp 

Met 

Pro 

lie 

Asp 

Pro 

Val 

Tyr Tyr Gin 

lieu 

Ala 
Ala 


Tyr 


1 




5 





10 




15 



Phe 

Asp 

Ser 

Leu 

Pro 

Lys 

Phe 

Asp 

Gin 

Phe 

Ser Ser 

Al 3 

nxa 

Arg 

V71 U 

iyr 





20 





25 








Arg 


Ala 
nxa 


Asn 

/Ai. y 

Tie* 


Glu 

Glu Arg Asn 

Arg 

ulu 

Leu 

oer 




35 





40 




45 





bin 

nis 

ulU 

Arg 

Val 

r-i 
ulu 

Arg 

val 

m ii 

ulu 

Asp Arg Thr 

xie 

Lys 

Giy 

Arg 



50 





55 




60 





10 

Asn 

Giy 

Asp 

11c 

Arg 

\/ a l 

vax 

Arg 

Val 

Tyr 

Gin Gin Lys 

fro 

ASp 

On V 

ber 

Pro 






* w 





75 




Qft 


vax 

Leu. 

vax 

Tyr 

Tyr 
85 

MIS 

uiy 

ui y 

uiy 

Phe 
90 

Val He 

Cys 

&er 

Ti- 
ne 

95 

GlU 


ber 

HIS 

Asp 

ax a 

Leu 

Cys 

Arg 

Arg 

i ie 

Ala Arg Leu 

C A V 

ber 

Asn 

Ser 

Thr 





100 





105 




110 



15 


vax 

Ser 

1 1 c 
HO 

vax 

Asp 

Tyr 

Arg 

Leu 

T Oft 

xzu 

Ala 

Aia 

Pro 

Glu His 

Lys 

Pne 

Pro 

Ala 


ax a 

vax 
130 

Tyr 

Asp 

Cys 

Tyr 

Asp 
135 

Ala 


Lys 

Trp Val 
140 

ax a 

GlU 

Asn 

Ala 


VjlU 


Leu 

Axg 

T 1 q 

xxe 

Asp 

Pro 

Ocl 

Lys 

He 

Phe Val 

biy 

Gly 

Asp 

Ser 

20 






lou 





155 




1 bU 


TV 1 - 

Aia 

Gly 

Giy 

Asn 

Leu 
165 

Ala 

Aia 

Ala 

vax 

Ser 
170 

He Met 

Ala 

Arg 

Asp 
175 

Ser 


Giy 


Asp 

irne 
180 

ixe 

Lys 

MIS 

bin 

lie 
185 

Leu 

He Tyr 

Pro 

Val 
190 

Val 

Asn 

25 

Pne 

Val 

Ala 

Pro 

Thr 

Pro 

Ser 

Leu 

Leu 

Glu 

Phe Gly 

Glu 

Gly 

Leu 

Trp 



iyj 





O Aft 









lie 

Leu 
zxu 

Asp 

Gin 

Lys 

lie 

Met 

Ser 

Trp 

Phe 

Ser Glu 
220 

Gin 

Tyr 

Phe. 

.Ser 


Arg 

Glu 

Glu 

Asp 

Lys 

Phe 

Asn 

Pro 

Leu 

Ala 

Ser Val 

He 

Phe 

Ala 

Asp 







9 n 





235 





30 

Leu 

Glu 

Asn 

Leu 

Pro 
245 

Pro 

Ala 

Leu 

He 

He 
250 

Thr Ala 

Glu 

Tyr 

Asp 
255 

Pro 


Leu 

Arg 

Asp 

Glu 
260 

Gly 

Glu 

Val 

Phe 

Gly 
265 

Gin 

Met Leu 

Arg 

Arg 
270 

Ala 

Gly 


Val 

Glu 

Ala 

Ser 

lie 

Val 

Arg 

Tyr 

Arg 

Gly Val Leu 

His 

Gly 

Phe 

He 

35 



275 





280 




285 




Asn 

Tyr 

Tyr 

Pro 

Val 

Leu 

Lys 

Ala 

Ala 

Arg Asp Ala 

He 

Asn 

Gin 

He 



290 





295 




300 






Ala 

Ala 

Leu 

Leu 

Val 

Phe 

Asp 










305 310 


40 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 

(v) FRAGMENT TYPE: internal 

(xr) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

55 
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Met 

Pro 

Leu 

Asp 

Pro Arg 

lie 

Lys Lys 

Leu Leu 

Glu 

Ser 

Ala 

Leu 

Thr 


1 




5 



10 




15 



lie 

Pro 

He 

Gly 

Lys Ala 

Pro 

Val Glu 

Glu Val 

Arg 

Lys 

lie 

Phe 

Arg 

5 







25 




30 



Gin 

Leu 

Ala 
35 

Ser 

Ala Ala 

Pro 

Lys Val 
40 

Glu Val 

Gly 

Lys 
45 

Val 

Glu 

Asp 


lie 

Lys 
50 

He 

Pro 

Gly Ser 

Glu 
55 

Thr Val 

lie Asn 

Ala 
60 

Arg 

Val 

Tyr 

Phe 

10 

Pro 
65 

Lys 

Ser 

Ser 

Gly Pro 
70 

Tyr 

Gly Val 

Leu Val 
75 

Tyr 

Leu 

His 

Gly 

Gly 
80 


Gly 

Phe 

Val 

lie 

Gly Asp 
85 

Val 

Glu Ser 

Tyr Asp 
90 

Pro 

Leu 

Cys 

Arg 
95 

Ala 


lie 

Thr 

Asn 

Ala 

Cys Asn 

Cys 

Val Val 

Val Ser 

Val 

Asp 

Tyr 

Arg 

Leu 

15 




100 



105 




110 



Ala 

Pro 

Glu 

Tyr 

Lys Phe 

Pro 

Ser Ala 

Val He Asp 

Ser 

Phe 

Asp 

Ala 




115 




120 



125 





Thr 

Asn 

Trp 

Val 

Tyr Asn 

Asn 

Leu Asp 

Lys Phe Asp 

Gly 

Lys 

Met 

Gly 



130 




135 



140 






Val 

Ala 

He 

Ala 

Gly Asp 

Ser 

Ala Gly 

Gly Asn Leu 

Ala 

Ala 

Val 

Val 

20 

145 




150 



155 





160 


Ala 

Leu 

Leu 

Ser 

Lys Gly 
165 

Lys 

lie Asn 

Leu Lys 
170 

Tyr 

Gin 

lie 

Leu 
175 

Val 


Tyr 

Pro 

Ala 

Val 
180 

Ser Leu 

Asp 

Asn Val 
185 

Ser Arg 

Ser 

Met 

lie 
190 

Glu 

Tyr 

25 

Ser 

Asp 

Gly 

Phe 

Phe Leu 

Thr 

Arg Glu 

His He 

Glu 

Trp 

Phe 

Gly 

Ser 



195 




200 



205 





Gin 

Tyr 

Leu 

Arg 

Ser Pro- Ala 

Asp Leu 

Leu Asp 

Phe 

Arg 

Phe 

Ser 

Pro 



210 




215 



220 






He 

Leu 

Ala 

Gin 

Asp Phe 

Asn 

Gly Leu 

Pro Pro 

Ala 

Leu 

lie 

He 

Thr 


225 




230 



235 





240 

30 

Ala 

Glu 

Tyr 

Asp 

Pro Leu 

Arg 

Asp Gin 

Gly Glu Ala 

Tyr 

Ala 

Asn 

Lys 






245 



250 




255 



Leu 

Leu 

Gin 

Ala 

Gly Val 

Ser 

Val Thr 

Ser Val Arg 

Phe 

Asn 

Asn 

Val 





260 



265 




270 




He 

His 

Gly 

Phe 

Leu Ser 

Phe 

Phe Pro 

Leu Met 

Glu 

Gin 

Gly 

Arg 

Asp 

35 



275 




280 



285 





Ala 

He 

Gly 

Leu 

He Gly Ser 

Val Leu 

Arg Arg Val 

Phe 

Tyr 

Asp 

Lys 



290 




295 



300 






He 













40 

305 














(2) INFORMATION FOR 

SEQ ID NO:43: 










(i) SEQUENCE CHARACTERISTICS: 


45 (A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
55 (B) LOCATION: 1...603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
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20 


(A) LENGTH: 779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

10 (A) NAME/KEY: Coding Sequence 

(B) LOCATIONS. ..777 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

15 

ATG ATA AAA AAC TTC GAC AGA GAA AAT TCT AGC TTA GTA CTG TCC GGT 48 
Met lie Lys Asn Phe Asp Arg Glu Asn Ser Ser Leu Val Leu Ser Gly 
15 10 15 

GGT GGT GCT CTG GGT ATT GCT CAC TTG GGT GTA CTG CAT GAC CTT GAA 96 
Gly Gly Ala Leu Gly He Ala His Leu Gly Val Leu His Asp Leu Glu 
20 25 30 

AAA CAA AAT ATT GTA CCA AAT GAA ATT GTT GGT ACA AGT ATG GGT GGT 144 

Lys Gin Asn He Val Pro Asn Glu He Val Gly Thr Ser Met Gly Gly 
25 35 40 45 

ATC ATT GGT GCA TCT ATG GCT ATC GGG ATG AAA GAG AAA GAA ATA CTC 192 
He He Gly Ala Ser Met Ala He Gly Met Lys Glu Lys Glu He Leu 
50 55 60 

30 

GAA GAA ATC AAA AAC TTT TCC AAT GTC TTC AAC TGG ATA AAA TTC TCT 24 0 

Glu Glu He Lys Asn Phe Ser Asn Val Phe Asn Trp He Lys Phe Ser 
65 70 75 80 

TTT TCC GGT AAT TCT GTT GTC GAT AAC GAG AAG ATC GCT AAG ATA TTT 288 
Phe Ser Gly Asn Ser Val Val Asp Asn Glu Lys He Ala Lys He Phe 
85 90 95 

GAT ACT CTT TTT AAA GAC AGA AAG ATG ACA GAT ACG GTG ATC CCT CTT 336 
Asp Thr Leu Phe Lys Asp Arg Lys Met Thr Asp Thr Val He Pro Leu 
40 100 105 110 

AAA CTC ATC GCT ACA AAC TTA CAT AAT GGA CAT AAA AAA GTA TTT ACT 384 
Lys Leu He Ala Thr Asn Leu His Asn Gly His Lys Lys Val Phe Thr 
115 120 125 


35 


50 


55 
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10 


GCT TCG GAT GAT GTA CTG ATC AAA GAT GCA ATA CTC TCA ACA ATG GCA 432 
Ala Ser Asp Asp Val Leu lie Lys Asp Ala He Leu Ser Thr Met Ala 
130 135 140 

ATA CCC GGT GTA TTT GAA GAA CAT ATT ATT GAT GGT GAA ACC TAT GGC 480 
He Pro Gly Val Phe Glu Glu His He He Asp Gly Glu Thr Tyr Gly 
145 150 155 160 

GAC GGT TTT CTT TGT GAA AAC CTT GGT GTG AAT GAG GCA ACA TTC AAT 528 
Asp Gly Phe Leu Cys Glu Asn Leu Gly Val Asn Glu Ala Thr Phe Asn 
165 170 175 

GAT GTT TTA GCT GTA GAT GTC ATG GGT GAG AAC TCT TTT GAA AAA GCA 57 6 

'5 Asp Val Leu Ala Val Asp Val Met Gly Glu Asn Ser Phe Glu Lys Ala 

180 185 190 

ATG CCG GAC AAC TTC TTT AAA ACA TCA AAT GTT TTA GAA ATG TTT GAA 624 
Met Pro Asp Asn Phe Phe Lys Thr Ser Asn Val Leu Glu Met Phe Glu 
20 195 200 205 

AAA TCA ATG CGA CTT TTT ATT TAC AAC CAG ACA CAG ACA CAT ATT AAA 672 
Lys Ser Met Arg Leu Phe He Tyr Asn Gin Thr Gin Thr His He Lys 
210 215 220 

25 AAT GCA AAT AAA AAT ATT TAT CTT ATT GAA CCC GTT ACC AAA GAG TAT 720 

Asn Ala Asn Lys Asn He Tyr Leu He Glu Pro Val Thr Lys Glu Tyr 
225 230 235 240 

AAA ACA TTT CAA TTT CAT AAA CAT AAA GAG ATA CGT GCT TTA GGC TTG 768 
30 Lys Thr Phe Gin Phe His Lys His Lys Glu He Arg Ala Leu Gly Leu 

245 250 255 

GGT TTA CTG TG 779 
Gly Leu Leu 

35 

(2) INFORMATION FOR SEQ ID NO:45: 
(i) SEQUENCE CHARACTERISTICS: 

40 (A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
so (B) LOCATION: 1...903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
55 ATG CCC CTA CAT CCA AAG GTA AAG AAA TTA CTT TCC CAG CTA CCT CCC 48 
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Met Pro Leu His Pro Lys Val Lys Lys Leu Leu Ser Gin Leu Pro Pro 
1 5-10 15 

5 CAG GAC TTC TCC AGA AAC GTG CAG GAC CTG AGG AAG GCC TGG GAT TTA 96 

Gin Asp Phe Ser Arg Asn Val Gin Asp Leu Arg Lys Ala Trp Asp Leu 
20 25 30 

CCC TTC TGA GGG AGG AGG GAG ACC CTG AAG AGG GTT GAG GAC CTT GAG 144 
10 Pro Phe Ser Gly Arg Arg Glu Thr Leu Lys Arg Val Glu Asp Leu Glu 

35 40 45 

ATA CCC ACT AGG GAC GCA CGA ATC AGG GCC AGG GTC TAC ACC CCC TCA 192 
lie Pro Thr Arg Asp Ala Arg He Arg Ala Arg Val Tyr Thr Pro Ser 
15 50 55 60 

AGT AAG GAA AAC TTA CCC GTC CTT GTT TAC TAT CAC GGC GGT GGC TTC 240 
Ser Lys Glu Asn Leu Pro Val Leu Val Tyr Tyr His Gly Gly Gly Phe 
65 70 75 30 

20 GTG TTC GGT AGC GTT GAC AGC TAC GAC GGC CTC GCA TCC CTT ATT GCC 288 

Val Phe Gly Ser Val Asp Ser Tyr Asp Gly Leu Ala Ser Leu He Ala 
85 90 95 

AAG GAA TCT GGG ATT GCG GTT ATC TCC GTG GAG TAT AGG CTC GCC CCT 336 
25 Lys Glu Ser Gly lie Ala Val He Ser Val Glu Tyr Arg Leu Ala Pro 

100 105 110 

GAG CAC AAG TTC CCC ACC GCA GTC AAC GAC TCG TGG GAT GCG CTT CTC 384 
Glu His Lys Phe Pro Thr Ala Val Asn Asp Ser Trp Asp Ala Leu Leu 
115 120 125 

30 

TGG ATC GCG GAG AAC GGA GGC AAG CTG GGG CTC GAC ACC TCG AGA CTT 432 
Trp He Ala Glu Asn Gly Gly Lys Leu Gly Leu Asp Thr Ser Arg Leu 
130 135 140 

35 GCC GTG GCT GGG GAT AGT GCT GGA GGA AAC CTG TCT GCC GTG GTG TCC 480 

Ala Val Ala Gly Asp Ser Ala Gly Gly Asn Leu Ser Ala Val Val Ser 
145 150 155 160 


CTC CTG GAC AGG GAC CAG GGT AAG GGA CTG GTT AGT TAT CAG GTC CTA 528 
Leu Leu Asp Arg Asp Gin Gly Lys Gly Leu Val Ser Tyr Gin Val Leu 
165 170 175 

ATC TAC CCA GCA GTG AAC ATG GTC GAT AAC TCC CCA TCC GTC AGG GAG 57 6 

He Tyr Pro Ala Val Asn Met Val Asp Asn Ser Pro Ser Val Arg Glu 
180 185 190 

TAC GGC GAG GGA TAC TTC CTC ACC AGG TCC ATG ATG AAC TGG TTC GGG 624 
Tyr Gly Glu Gly Tyr Phe Leu Thr Arg Ser Met Met Asn Trp Phe Gly 
195 200 205 

50 ACC ATG TAC TTC TCC TCT GGA AGG GAA GCG GTA TCC CCC TAC GCC TCT 672 

Thr Met Tyr Phe Ser Ser Gly Arg Glu Ala Val Ser Pro Tyr Ala Ser 
210 215 220 


AO 


45 


55 
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CCA GCC TTG GCT GAC CTA CAT AAC CTC CCA CCC TCA CTG GTG ATC ACT 
Pro Ala Leu Ala Asp Leu His Asn Leu Pro Pro Ser Leu Val He Thr 
225 230 235 240 

GCA GAG TAT GAT CCC CTA AGG GAT CAG GGA GAG ACC TAC TCT CAC TCC 
Ala Glu Tyr Asp Pro Leu Arg Asp Gin Gly Glu Thr Tyr Ser His Ser 
245 250 255 

CTA AAC GAG GCT GGA AAC GTA TCA ACC TTG GTT AGA TAT CAA GGA ATG 
Leu Asn Glu Ala Gly Asn Val Ser Thr Leu Val Arg Tyr Gin Gly Met 
260 265 270 

ATT CAC GGC TTC CTG TCC TTC TAC GAG TGG ATA ACT GCC GGT AAA CTA 
lie His Gly Phe Leu Ser Phe Tyr Glu Trp lie Thr Ala Gly Lys Leu 
275 280 285 

GCC ATT CAC CAC ATT GCT GGG GTT CTG AGA TCT GTC CTT TA 
Ala He His His He Ala Gly Val Leu Arg Ser Val Leu 
290 295 300 


(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...975 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 


GTG GCC TTC TTC GAT ATG CCC CTT GAG GAA CTG AAA AAG TAC CGG CCT 
Val Ala Phe Phe Asp Met Pro Leu Glu Glu Leu Lys Lys Tyr Arg Pro 
15 10 15 

GAA AGG TAC GAG GAG AAA GAT TTC GAT GAG TTC TGG AGG GAA ACA CTT 
Glu Arg Tyr Glu Glu Lys Asp Phe Asp Glu Phe Trp Arg Glu Thr Leu 
20 25 30 

AAA GAA AGC GAA GGA TTC CCT CTG GAT CCC GTC TTT GAA AAG GTG GAC 
Lys Glu Ser Glu Gly Phe Pro Leu Asp Pro Val Phe Glu Lys Val Asp 
35 40 45 

TTT CAT CTC AAA ACG GTT GAA ACG TAC GAT GTT ACT TTC TCT GGA TAC 
Phe His Leu Lys Thr Val Glu Thr Tyr Asp Val Thr Phe Ser Gly Tyr 
50 55 60 
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5 


15 


20 


30 


40 


50 


AGG 

GGG 

GAG 

AGA 

ATA 

AAG 

GGC 

TGG 

CTT 

CTT 

GTT CCG 

AAG TTG 

GCG 

GAA 

240 

Arg 

Gly Gin 

Arg 

He 

Lys 

Gly 

Trp 

Leu Leu 

Val Pro Lys Leu 

Ala 

Glu 


65 





70 





75 



80 


GAA 

AAG 

CTT 

CCA 

TGC 

GTC 

GTG 

CAG 

TAC 

ATA 

GGT TAC 

AAT GGT 

GGA 

AGG 

288 

Glu 

Lys 

Leu 

Pro 

Cvs 

Val 

Val 

Gin 

Tyr 

He 

Gly Tyr Asn Gly 

Glv 

Arg 






85 





90 



95 



GGT 

TTT 

CCA 

CAC 

GAC 

TGG 

CTG 

TTC 

TGG 

CCG 

TCA ATG 

GGT TAC 

ATC 

TGT 

336 

Gly 

Phe 

Pro 

His 

Asp 

Trp 

Leu 

Phe 

Trp 

Pro 

Ser Met Gly Tyr 

He 

Cys 





100 





105 



110 




TTT 

GTC 

ATG 

GAC 

ACC 

AGG 

GGG 

CAG 

GGA 

AGC 

GGC TGG ATG AAG 

GGA 

GAC 

384 

Phe 

Val 

Met 

Asp 

Thr 

Arg 

Gly 

Gin 

Gly Ser 

Gly Trp Met Lys 

Gly 

Asp 




115 





120 




125 




ACA 

CCG 

GAT 

TAC 

CCT 

GAG 

GGT 

CCA 

GTC 

GAT 

CCA CAG 

TAC CCC 



H 3£ 

Thr 

Pro Asp 

Tyr 

Pro 

Glu 

Gly 

Pro 

Val 

Asp 

Pro Gin Tyr Pro 

Gly 

Phe 



130 





135 




140 





ATG 

ACG 

AGG 

GGC 

ATT 

CTG 

GAT 

CCG 

GGA 

ACC 

TAT TAC 

TAC AGG 

CGA 

GTC 

inn 

H O \J 

Met 

Thr Arg 

Gly 

He 

Leu 

Asp 

Pro 

Gly Thr 

Tyr Tyr Tyr Arg 

Arg 

Val 


145 





150 





155 



160 


1 1L 

GTG 

GAT 





ulu 

GAA 

GCA 

GCC ATT 

TCC TTC 



COO 

Phe 

Val 

Asp 

Ala 

Val 

Arg 

Ala 

Val 

Glu 

Ala 

Ala He 

Ser Phe 

Pro 

Arg 






165 





170 



175 




GAT 

TCC 


aan 




GCC 

GGA 

GGC AGT 

CAG GGT 



ci a 

D tO 

val 

Asp 

Ser 

Arg 

Lys 

val 

val 

Val 

Ala 

Gly 

Gly Ser Gin Gly 

Gly 

Gly 





180 





185 



190 





ccc 

CTT 






TCG 

AAC 

AGG GTG 

AAG GCT 

Liu 

Liu 

CO A 

He 

Pro 

Leu 

Ala 

Val 

Ser 

Ala 

Leu 

Ser 

Asn 

Arg Val 

Lys Ala 

Leu 

Leu 




195 





200 




205 




TGC 

GAT 

GTG 

CCG 

TTT 

CTG 

TGC 

CAC 

TTC 

AGA 

AGG GCC 

GTG CAA 



CIO 

Cys 

Asp 

Val 

Pro 

Phe 

Leu 

Cys 

His 

Phe Arg 

Arg Ala Val Gin 

Leu 

Val 



210 





215 




220 





GAC 

ACA 

CAC 

CCA 

TAC 

GTG 

GAG 

ATC 

ACC 

AAC 

TTC CTC 

AAA ACC 

CAC 


"79n 

Asp 

Thr 

His 

Pro 

Tyr 

val 

Glu 

He 

Thr 

Asn 

Phe Leu 

Lys Thr 

His 

Arg 


225 





230 





235 



240 


GAC 

AAA 

GAG 

GAG 

ATT 

GTT 

TTC 

AGA 

ACA 

CTT 

TCC TAC 

TTC GAT 


oik? 

ICR 
f DO 

Asp 

Lys 

Glu 

"jIU 

Tin 

lie 

Vol 


Arg 

Thr 

Leu 

Ser Tyr 

Phe Asp 

oxy 

If- 1 

val 






245 





250 



255 



AAC 

TTT 

GCA 

GCA 

AGG 

GCA 

AAG 

GTG 

CCC 

GCC 

CTG TTT 

TCC GTT 

GGG 

CTC 

816 

Asn 

Phe 

Ala 

Ala 

Arg 

Ala 

Lys 

Val 

Pro 

Ala 

Leu Phe 

Ser Val 

Gly 

Leu 





260 





265 



270 



ATG 

GAC 

ACC 

ATC 

TGT 

CCT 

CCC 

TCG 

ACG 

GTC 

TTC GCC 

GCT TAC 

AAC 

CAC 

864 

Met 

Asp 

Thr 

He 

Cys 

Pro 

Pro 

Ser 

Thr 

Val 

Phe Ala 

Ala Tyr 

Asn 

His 




275 





280 




285 





55 
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TAC GCC GGT CCA AAG GAG ATC AGA ATC TAT CCG TAC AAC AAC CAC GAA 912 
Tyr Ala Gly Pro Lys Glu lie Arg lie Tyr Pro Tyr Asn Asn His Glu 
290 295 300 

5 

GGT GGA GGT TCT TTC CAG GCA ATT GAG CAG GTG AAA TTC TTG AAG AGA 960 
Gly Gly Gly Ser Phe Gin Ala lie Glu Gin Val Lys Phe Leu Lys Arg 
305 310 315 320 

W CTA TTT GAG GAA GGC TAG 978 

Leu Phe Glu Glu Gly 
325 


15 


45 


50 


(2) INFORMATION FOR SEQ ID NO:47: 
(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 897 base pairs 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...894 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

ATG CGC ACC CTC TCC TTC GGT CCG ATG ACC ACA GGG GGA AGC ATT CAC 48 
Met Arg Thr Leu Ser Phe Gly Pro Met Thr Thr Gly Gly Ser lie His 
35 1 5 10 15 

ATG GCG ACC ATG GAC GTG ATG CGC GGG CCG GGG ATG CAG CGG CTG TCA 96 
Met Ala Thr Met Asp Val Met Arg Gly Pro Gly Met Gin Arg Leu Ser 
20 25 30 

« CAG GGC GCC AGG GAG GCC GCG AAC CAC CCC TGG GCG AAG CGA CTG GGC 144 

Gin Gly Ala Arg Glu Ala Ala Asn His Pro Trp Ala Lys Arg Leu Gly 
35 40 45 


CGC ATG GGC TAC GCG GCC AAG GGC GCC GTG TAC GCC ATC ATC GGC GTG 192 

Arg Met Gly Tyr Ala Ala Lys Gly Ala Val Tyr Ala lie lie Gly Val 
50 55 60 

CTC GCG CTG AAG CTC GCG GCG GGC GAG GGC GGC CGG ACC ACG GAC AGC 240 

Leu Ala Leu Lys Leu Ala Ala Gly Glu Gly Gly Arg Thr Thr Asp Ser 
65 70 75 80 

CAC GGC GCG GTG AAC ACC GTG GCG CAC GGG CCC TTC GGC GTC GCG CTG 288 

His Gly Ala Val Asn Thr Val Ala His Gly Pro Phe Gly Val Ala Leu 
85 90 95 

55 CTG GCG GTG CTG GTG GTG GGC CTG CTG GGC TAC GTG GTC TGG AGG TTC 336 

Leu Ala Val Leu Val Val Gly Leu Leu Gly Tyr Val Val Trp Arg Phe 

100 105 110 
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GCC CAG GCC TTC GTG GAC ACG GAG GAC AAG GGC TCC GAC GCG AAG GGA 
Ala Gin Ala Phe Val Asp Thr Glu Asp Lys Gly Ser Asp Ala Lys Gly 
115 120 125 


384 


10 


ATC GCC ACG CGC GCC ATG TAC TTC CTC AGC GGC TGC ATC TAC GCG TCG 432 
He Ala Thr Arg Ala Met Tyr Phe Leu Ser Gly Cys He Tyr Ala Ser 
130 135 140 

CTG GCC TTC TTC GCC GCG CAG TCC CTG GTG GGC GCC GCG CAC GGC CGG 480 
Leu Ala Phe Phe Ala Ala Gin Ser Leu Val Gly Ala Ala His Gly Arg 
145 150 155 160 


15 AGC AAG GGG ACG CAG GGC TGG ACG GCC ACG CTG ATG GAG CAG CCC TTT 

Ser Lys Gly Thr Gin Gly Trp Thr Ala Thr Leu Met Glu Gin Pro Phe 
165 170 175 


528 


20 


GGC CGC GTG CTG GTG GCG CTG GTG GGG CTG GGC ATC GTG GGC TTC GCG 
Gly Arg Val Leu Val Ala Leu Val Gly Leu Gly He Val Gly Phe Ala 
180 185 190 


576 


25 


CTG AAG CAG TTC CAC ACC GCG TGG AAG GCG AAG TTC CGG GAG AAG CTC 624 
Leu Lys Gin Phe His Thr Ala Trp Lys Ala Lys Phe Arg Glu Lys Leu 
195 200 205 

ACC CTC ACC GGA CTG GCT GCC CGG AAG CAG CAC CAC ATC GAG CGC ATG 672 
Thr Leu Thr Gly Leu Ala Ala Arg* Lys Gin His His He Glu Arg Met 
210 215 220 


30 


TGC CAG TTC GGC ATC GCC GCG CGC GGC GTG GTG TTC GCC GTC ATC GGC 
Cys Gin Phe Gly He Ala Ala Arg Gly Val Val Phe Ala Val He Gly 
225 230 235 240 


720 


35 


GGC TTC CTC GTC CGC TCC GCC GTG GAC GCG AAC CCC GGC GAG GCC AAG 
Gly Phe Leu Val Arg Ser Ala Val Asp Ala Asn Pro Gly Glu Ala Lys 
245 250 255 


768 


40 


GGC CTG GGA GAG GCC CTG GCC GTC GTC GCG AGG CAG CCG TCC GGC GAC 816 

Gly Leu Gly Glu Ala Leu Ala Val Val Ala Arg Gin Pro Ser Gly Asp 
260 265 270 

GTG CTC CTG GGG GTG GTG GCG GCG GGC CTG GTG GCC TAC GCC GCC TAC 864 

Val Leu Leu Gly Val Val Ala Ala Gly Leu Val Ala Tyr Ala Ala Tyr 

275 280 285 


45 


CTG TTC CTC CAG GCG CGC TAC CGC GAA CTC TAG 
Leu Phe Leu Gin Ala Arg Tyr Arg Glu Leu 
290 295 


897 


50 


(2) INFORMATION FOR SEQ ID NO:48: 
(r) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 914 base pairs 

(B) TYPE: nucleic acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 


115 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...912 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 


ATG AGC AAA TTC GCA ATA CTC TGG GCG TTG ATA ACG GCA TAC CTG CCG 48 
Met Ser Lys Phe Ala lie Leu Trp Ala Leu lie Thr Ala Tyr Leu Pro 
15 10 15 

GAA CCT GTG ATG AAA CTG GTA TAT TTA GGG CGG CGC GAA ACG CTT GGG 96 
Glu Pro Val Met Lys Leu Val Tyr Leu Gly Arg Arg Glu Thr Leu Gly 
20 25 30 

GCA CGG ACG CTT GAC GTT AAA GCC CAA GCT GTC GGG CGG CTG GCC AAT 144 
Ala Arg Thr Leu Asp Val Lys Ala Gin Ala Val Gly Arg Leu Ala Asn 
35 40 45 

20 GCA ACA AGA CCT GTC GGG GTG ATT CCG ACG GTC GAG GAA' AGC CGG AAG 192 

Ala Thr Arg Pro Val Gly Val lie Pro Thr Val Glu Glu Ser Arg Lys 
50 55 60 

r 

ATG ACG GAT AAA GCC GTT AGC CTT TTT GAT CAG CCC GCC CCC GAA TTA 240 
25 Met Thr Asp Lys Ala Val Ser Leu Phe Asp Gin Pro Ala Pro Glu Leu 

65 70 75 80 

TTC CGT AAA AAA GAC ATT CAG ATT GAC GGG GCT GAA GGG CCT ATT GAT 288 
Phe Arg Lys Lys Asp lie Gin lie Asp Gly Ala Glu Gly Pro lie Asp 
85 90 95 


10 


15 


30 


35 


40 


45 


50 


55 


GCC CGT ATT TAC AGC GGC CCT, GCA AAA CAT CGC CCR CGR CCA ATW CTA . 336 
Ala Arg lie Tyr Ser Gly Pro Ala Lys His Arg Xaa Xaa Pro Xaa Leu 
100 105 110 

GTG TAT TTT CAC GGC GGT GGC TGG GTT CAG GGC AAT CTG GAC AGC CAT 384 
Val Tyr Phe His Gly Gly Gly Trp Val Gin Gly Asn Leu Asp Ser His 
115 120 125 

GAC GGG GTT TGC GGC AAG CTG GCA AAA TGG GCG AAC TGC ATT GTT ATC 432 
Asp Gly Val Cys Gly Lys Leu Ala Lys Trp Ala Asn Cys lie Val lie 
130 135 140 

TCG GTC GAT TAT CGT CTA GCG CCC GAA CAC AAA TTT CCT TGT GCG CCG 480 
Ser Val Asp Tyr Arg Leu Ala Pro Glu His Lys Phe Pro Cys Ala Pro 
145 150 155 160 

CTT GAT GCG ATT GCG GCC TAT AAA TGG GTG CGC GCC AAC GCA ACA AAC 528 
Leu Asp Ala He Ala Ala Tyr Lys Trp Val Arg Ala Asn Ala Thr Asn 
165 170 175 

CTT GGC GGC GAT CCT GAA CGT ATC GGC GTT GGC GGC GAT AGC GCA GGG 576 
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20 


Leu Gly Gly Asp Pro Glu Arg lie Gly Val Gly Gly Asp Ser Ala Gly 
180 185 190 

5 GGC AAT CTT GCC GCC GTT GTC TGC CAA CAA ACC GCC ATG AAC GGC GAG 624 

Gly Asn Leu Ala Ala Val Val Cys Gin Gin Thr Ala Met Asn Gly Glu 
195 200 205 

CGC ACA CCA GAT CTG CAA GTC CTG ATC TAT CCG GCG CTG GAT GCA CGC 672 
10 Arg Thr Pro Asp Leu Gin Val Leu He Tyr Pro Ala Leu Asp Ala Arg 

210 215 220 

ATG ATC TCG ACC TCG ATG GAG GAA TTG CGT GAT GCC TAC ATC TTG CCG 720 
Met He Ser Thr Ser Met Glu Glu Leu Arg Asp Ala Tyr He Leu Pro 
15 225 230 235 240 

AAA TCC AGA ATG GAG TAT TTC CTC GGC CTA TAT ACG CGT GGC CCT GAC 768 
Lys Ser Arg Met Glu Tyr Phe Leu Gly Leu Tyr Thr Arg Gly Pro Asp 
245 250 255 

GAT ATC GAG GAC CTT AGG ATG TCG CCA ATT CTC AGG GAT ACC GTC GCG 816 
Asp He Glu Asp Leu Arg Met Ser Pro He Leu Arg Asp Thr Val Ala 
260 265 270 

GAT CAA CCC CAA GCC TGC ATT GTC ACC TGT GGG TTT GAC CCT GCG CGA 864 
25 Asp Gin Pro Gin Ala Cys He Val Thr Cys Gly Phe Asp Pro Ala Arg 

275 280 285 

CGA CGG GAA CAC CTA CGC CGA ACG CTT AAT TGC CGA GGG GAT AGA CGT 912 
Arg Arg Glu His Leu Arg Arg Thr Leu Asn Cys Arg Gly Asp Arg Arg 
290 295 300 

30 

TA 914 
(2) INFORMATION FOR SEQ ID NO:49: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 926 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
45 (ix) FEATURE: 


(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...924 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 


GTG AGC ATT CGT CTG CGA CTG TTA AAC TGG TTT TTG AAT ACC TTT GAA 48 
Val Ser He Arg Leu Arg Leu Leu Asn Trp Phe Leu Asn Thr Phe Glu 
15 10 15 


55 


AAA CCA AAA CTG GCC GCG GCC AAA ACG CCG GAT GAT TTG CGA AAA TCG 96 
Lys Pro Lys Leu Ala Ala Ala Lys Thr Pro Asp Asp Leu Arg Lys Ser 
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20 


20 25 30 

5 TTT GAA TTA AAG GCG AGG TTT TTG TTT CCG GCG CCA CGT AAA ACA AGG 144 

Phe Glu Leu Lys Ala Arg Phe Leu Phe Pro Ala Pro Arg Lys Thr Arg 
35 40 45 

TTT AGT CAT GAT GTA TTG CAG TCA GGC ATC GGG TCG GTA AAT GCC CAG 192 
Phe Ser His Asp val Leu Gin Ser Gly He Gly Ser Val Asn Ala Gin 
10 50 55 60 

TGG GCG AAA TCC AAA TCT GCA TCT GAT GAC AGG GTA ATC CTG TAT TTT 240 
Trp Ala Lys Ser Lys Ser Ala Ser Asp Asp Arg Val He Leu Tyr Phe 
65 70 75 80 

15 

CAT GGG GGA GGG TAT GTT TTT GGG TCA CCA AAA ACG CAC CGT GCA ATG 288 
His Gly Gly Gly Tyr Val Phe Gly Ser Pro Lys Thr His Arg Ala Met 
85 90 95 

TTG GCG CGC TTG TCG GCA ATG ACA GGT CTT TCT GCG TGC CTT CCA GAT 336 
Leu Ala Arg Leu Ser Ala Met Thr Gly Leu Ser Ala Cys Leu Pro Asp 
100 105 110 

TAT AGG TTG GCA CCA GAG CAC CCA TTT CCA GCC GCG ATC GAA GAT GCA 384 
Tyr Arg Leu Ala Pro Glu His Pro Phe Pro Ala Ala He Glu Asp Ala 
25 H5 120 125 

GTT TTA TCG TAT AAA TGT TTA CTA GAG CGA GCA ATC GAG CCC CAA AAT 432 
Val Leu Ser Tyr Lys Cys Leu Leu Glu Arg Ala He Glu Pro Gin Asn 
130 135 140 

30 ATT ATA CTG GGG GGG GAC AGT GCT GGT GGC GGT TTG GTT CTT GCT TTG 480 

He He Leu Gly Gly Asp Ser Ala Gly Gly Gly Leu Val Leu Ala Leu 
145 150 155 160 

CTT GCA GAA ATC AAG GCC CAA TCC TTG CCC AAA CCT GCT GGC GTT TTT 528 
35 Leu Ala Glu lie Lys Ala Gin Ser Leu Pro Lys Pro Ala Gly Val Phe 

165 170 175 

GCC TTG TCG CCT TTG GTT GAT TTA TCA TTT TCG GGC CTT TCG TTT TCT 57 6 

Ala Leu Ser Pro Leu Val Asp Leu Ser Phe Ser Gly Leu Ser Phe Ser 
180 185 190 

40 

AAA AAT GCC CAA ACC GAT GTG ATG TTG CCC GCA TCA CGG GCT GCG GAT 624 

Lys Asn Ala Gin Thr Asp Val Met Leu Pro Ala Ser Arg Ala Ala Asp 
195 200 205 

45 ATG GCG ACC TTG TAT TTG GAT GGG GCC GAT GCA GAT GAT CCA CGT GCA 672 

Met Ala Thr Leu Tyr Leu Asp Gly Ala Asp Ala Asp Asp Pro Arg Ala 
210 215 220 

TCG CCG CTG CAG GCG GAT TTT TCT GGC ATG CCG CCT GTA TTT CTG ACA 720 
Ser Pro Leu Gin Ala Asp Phe Ser Gly Met Pro Pro Val Phe Leu Thr 
225 230 235 240 

GCA AGT GAC AGT GAA ATC CTG TTG GAT GAT TGC CTG CGG ATG GCG GAT 768 


50 


55 
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20 


Ala Ser Asp Ser Glu lie Leu Leu Asp Asp Cys Leu Arg Met Ala Asp 
245 250 255 

5 CAC TTG CGT GCG CAA GGT GTC GTT GTG ACA GAC CGG ATT GTT GAA AAC 816 

His Leu Arg Ala Gin Gly Val Val Val Thr Asp Arg He Val Glu Asn 
260 265 270 

CAT CCA CAT GTT TGG CAT ATT TTT CAA CGC CTT CTA CCC GAA GCA GAT 864 
w His Pro His Val Trp His He Phe Gin Arg Leu Leu Pro Glu Ala Asp 

275 280 285 

CAG GGG CTG CGG GCG ATT GCC GCG TGG ATT AAA CCT CTT TTA TCA GGT 912 
Gin Gly Leu Arg Ala He Ala Ala Trp He Lys Pro Leu Leu Ser Gly 
1S 290 295 300 

TCA AAC GAA AGC TA 926 

Ser Asn Glu Ser 

305 

(2) INFORMATION FOR SEQ ID NO:50: 
(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: ■ - 

(A) NAME/KEY: Coding Sequence 
35 (B) LOCATION: 1...711 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:^ 

40 ATG CTT ACA TTT AAT GTT TTA TAT GGT ATG ATG AAA CAA AAA CTA GCA 48 

Met Leu Thr Phe Asn Val Leu Tyr Gly Met Met Lys Gin Lys Leu Ala 
15 10 15 


45 


50 


55 


GCA ATT CTC ATG TTT TTA GGG CTA TCA GCA GCA GAG GCT CAA GAC TGG 96 
Ala He Leu Met Phe Leu Gly Leu Ser Ala Ala Glu Ala Gin Asp Trp 
20 25 30 

CCT GAC CTA CAG AAA TAT CGT AGT GCT AAT AAA GAA GCC AAA TTA CTT 144 
Pro Asp Leu Gin Lys Tyr Arg Ser Ala Asn Lys Glu Ala Lys Leu Leu 
35 40 45 

CCA AAG GAA AAC CGG AAG GTG GTT TTT ATG GGC AAC TCC ATT ACA GAA 192 
Pro Lys Glu Asn Arg Lys Val Val Phe Met Gly Asn Ser He Thr Glu 
50 55 60 

GCC TGG ATT AGT CAG CGA CCT GAG TTT TTT AGT GAA AAT GGG TTT ATC 240 
Ala Trp He Ser Gin Arg Pro Glu Phe Phe Ser Glu Asn Gly Phe He 
65 70 75 80 
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5 


15 


25 


30 


35 


GGT 

CGA 

GGC 

ATC 

AGT 

GGC 

CAG 

ACA ACC 

CCT 

CAG 

ATG 

TTG 

TTG AGA 

TTC 

288 

Gly 

Arg 

Gly 

He 

Ser 

Gly 

Gin 

Thr 

Thr 

Pro 

Gin 

Met 

Leu 

Leu Arg 

Phe 






85 





90 




95 



CGA 

CAG 

GAT 

GTG 

ATA 

GAC 

CTG 

CAG 

CCA AAG 

GCT 

GTA 

GTG 

ATA CTA 

GCT 

336 

Arg 

Gin 

Asp 

Val 

He 

Asp 

Leu 

Gin Pro Lys Ala 

Val 

Val 

He Leu 

Ala 





100 





105 





110 



GGT 

ACC 

AAT 

GAC 

GTA 

GCT 

CAA 

AAT 

ACC 

GGG 

CCG 

ATG 

ACC 

ATT GAG 

GAA 

J04 

Gly 

Thr 

Asn 

Asp 

Val 

Ala 

Gin 

Asn Thr Gly Pro 

Met 

Thr 

lie Glu 

Glu 




115 





120 





125 




Ten 


Uv 1 

AAC 

ATT 

AAG 


ATG 

GTG 

GAG 

CTG 

Cirri 


GCC AAT 

GGG 


Ser 

Leu 

Ala 

Asn 

He 

Lys 

Ser 

Met 

Val 

Glu 

Leu 

Ala 

Gin 

Ala Asn Gly 



130 





135 





140 





AX V- 



GTT 

1 1 VJ 

I o 1 


GTG 

CTG 

CCT 

GCA 



TTC AGC 

TGG 

A O A 

He 

Thr 

Pro 

val 

Leu 

Cys 

Thr 

Val 

Leu 

Pro 

Ala 

Asp 

Arg 

Phe Ser 

Trp 


145 





150 





155 




160 


LbA 


rap 


■ML- A 



GAA 

ACT 

ATC 

ATT 


CTC 

AAT CAG 

CTC 

COO 

Arg 

Pro 

Glu 

Leu 

Thr 

Pro 

Ala 

Glu 

Thr 

He 

He 

Ala 

Leu 

Asn Gin 

Leu 






165 





170 




175 



ATT 

AAG 

CAA 

TAT 


bAG 

bCA 

CAG 

GGC 

CTG 

GCC 

CTG 

GTG 

GAT TAT 

CAT 

576 

He 

Lys 

Gin 

Tyr 

Ala 

Glu 

Ala 

Gin 

Gly Leu 

Ala 

Leu 

Val 

Asp Tyr 

His 





180 





185 





190 



GCT 

GCA 

CTC 

ACC 

AAT 

AAA 

GGT 

GGA 

GGA 

CTT 

CCG 

GTG 

AAA 

TAC GGA 

GAA 

624 

Ala 

Ala 

Leu 

Thr 

Asn 

Lys 

Gly 

Gly Gly Leu 

Pro 

Val 

Lys 

Tyr Gly Glu 




195 





200 





205 




GAT 

GGT 

GTG 

CAT 

CCA 

AAT 

GTA 

GCA 

GGC 

TAT 

CAG 

GTG 

ATG 

GAA AAC 

ATT 

672 

Asp 

Gly 

Val 

His 

Pro 

Asn 

Val 

Ala Gly Tyr Gin 

Val 

Met 

Glu Asn 

He 



210 





215 





220 





GTT 

TTA 

CCG 

GTC 

ATT 

TCC 

AGC 

GAG 

TTG 

GCA 

AAG 

CTG 

AAG 

TA 


713 

Val 

Leu 

Pro 

Val 

He 

Ser 

Ser 

Glu 

Leu 

Ala 

Lys 

Leu 

Lys 




225 





230 





235 







40 (2) INFORMATION FOR SEQ ID NO:51 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...975 

55 
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10 


15 


20 


35 


40 


<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATG GCC TTC TTC GAT TTA CCA CTC GAA GAA CTG AAG AAA TAT CGT CCA 48 
Met Ala Phe Phe Asp Leu Pro Leu Glu Glu Leu Lys Lys Tyr Arg Pro 
15 10 15 

GAG CGG TAC GAA GAG AAA GAC TTC GAT GAG TTC TGG GAA GAG ACA CTC 96 
Glu Arg Tyr Glu Glu Lys Asp Phe Asp Glu Phe Trp Glu Glu Thr Leu 
20 25 30 

GCA GAG AGC GAA AAG TTC CCC TTA GAC CCC GTC TTC GAG AGG ATG GAG 144 
Ala Glu Ser Glu Lys Phe Pro Leu Asp Pro Val Phe Glu Arg Met Glu 
35 40 45 

TCT CAC CTC AAA ACA GTC GAA GCG TAC GAT GTC ACC TTC TCC GGA TAC 192 
Ser His Leu Lys Thr Val Glu Ala Tyr Asp Val Thr Phe Ser Gly Tyr 
50 55 60 

AGG GGA CAG AGG ATC AAA GGG TGG CTC CTT GTT CCA AAA CTG GAA GAA 240 
Arg Gly Gin Arg lie Lys Gly Trp Leu Leu Val Pro Lys Leu Glu Glu 
65 70 75 80 


GAA AAA CTT CCC TGC GTT GTG CAG TAC ATA GGA TAC AAC GGT GGA AGA 288 
Glu Lys Leu Pro Cys Val Val Gin Tyr lie Gly Tyr Asn Gly Gly Arg 
25 85 90 95 

GGA TTC CCT CAC GAC TGG CTG TTC TGG CCT TCT ATG GGT TAC ATA TGT 336 
Gly Phe Pro His Asp Trp Leu Phe Trp Pro Ser Met Gly Tyr lie Cys 
100 105 110 

30 TTC GTC ATG GAT ACT CGA GGT CAG GGA AGC GGC TGG CTG AAA GGA GAC 384 

Phe Val Met Asp Thr Arg Gly Gin Gly Ser Gly Trp Leu Lys Gly Asp 
115 120 125 

ACA CCG GAT TAC CCT GAG GGT CCC GTT GAC CCT CAG TAT CCA GGA TTC 4 32 

Thr Pro Asp Tyr Pro Glu Gly Pro Val Asp Pro Gin Tyr Pro Gly Phe 
130 135 140 

ATG ACA AGA GGA ATA CTG GAT CCC AGA ACT TAC TAC TAC AGA CGA GTC 480 
Met Thr Arg Gly lie Leu Asp Pro Arg Thr Tyr Tyr Tyr Arg Arg Val 
145 150 155 160 

TTC ACG GAC GCT GTC AGA GCC GTT GAA GCT GCT GCT TCT TTT CCT CAG 528 
Phe Thr Asp Ala Val Arg Ala Val Glu Ala Ala Ala Ser Phe Pro Gin 
165 170 175 

45 GTA GAT CAA GAA AGA ATC GTG ATA GCT GGA GGC AGT CAG GGT GGC GGA 576 

Val Asp Gin Glu Arg lie Val lie Ala Gly Gly Ser Gin Gly Gly Gly 
180 185 190 

ATA GCC CTT GCG GTG AGC GCT CTC TCA AAG AAA GCA AAG GCT CTT CTG 624 
50 lie Ala Leu Ala Val Ser Ala Leu Ser Lys Lys Ala Lys Ala Leu Leu 

195 200 205 

TGC GAT GTG CCG TTT CTG TGT CAC TTC AGA AGA GCA GTA CAG CTT GTG 672 

55 
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Cys Asp Val Pro Phe Leu Cys His Phe Arg Arg Ala Val Gin Leu Val 
210 215 220 

GAT ACG CAT CCA TAC GCG GAG ATC ACG AAC TTT CTA AAG ACC CAC AGA 
Asp Thr His Pro Tyr Ala Glu lie Thr Asn Phe Leu Lys Thr His Arg 
22.5 230 235 240 

GAC AAG GAA GAA ATC GTG TTC AGG ACT CTT TCC TAT TTC GAT GGA GTG 
Asp Lys Glu Glu He Val Phe Arg Thr Leu Ser Tyr Phe Asp Gly Val 
245 250 255 

AAC TTC GCA GCC AGA GCG AAG ATC CCT GCG CTG TTT TCT GTG GGT CTC 
Asn Phe Ala Ala Arg Ala Lys He Pro Ala Leu Phe Ser Val Gly Leu 
260 265 270 

ATG GAC AAC ATT TGT CCT CCT TCA ACG GTT TTC GCT GCC TAC AAT TAC 
Met Asp Asn He Cys Pro Pro Ser Thr Val Phe Ala Ala Tyr Asn Tyr 
275 280 285 

TAC GCT GGA CCG AAG GAA ATC AGA ATC TAT CCG TAC AAC AAC CAC GAG 
Tyr Ala Gly Pro Lys Glu He Arg He Tyr Pro Tyr Asn Asn His Glu 
290 295 300 

GGA GGA GGC TCT TTC CAA GCG GTT GAA CAG GTG AAA TTC TTG AAA AAA 
Gly Gly Gly Ser Phe Gin Ala Val Glu Gin Val Lys Phe Leu Lys Lys 
305 310 315 320 


CTA TTT GAG AAA GGC TAA 
Leu Phe Glu Lys Gly 
325 


(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...630 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 


TTG AAG TAC TTC AAA GCC CGG CTT GCC GGC ATC ACC TTG CTC GGC CTG 
Leu Lys Tyr Phe Lys Ala Arg Leu Ala Gly He Thr Leu Leu Gly Leu 
15 10 15 

CTG GCC TGC ACC TCG GCC TCG GCG CAG ACC GAG CCC ATC GTG TTC GTG 
Leu Ala Cys Thr Ser Ala Ser Ala Gin Thr Glu Pro He Val Phe Val 
20 25 30 
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CAC GGC TAT TCC GGC AGC GCA TCC AAG TGG GAC ACC ATG CTG GGC CGC 
His Gly Tyr Ser Gly Ser Ala Ser Asn Trp Asp Thr Met Leu Gly Arg 
35 40 45 


144 


10 


TTC CGG TCG AAC GGT TAT GCG TCC GGC TCG CTC TAC ACC TTC AAC TAC 
Phe Arg Ser Asn Gly Tyr Ala Ser Gly Ser Leu Tyr Thr Phe Asn Tyr 
50 55 60 

AAC TCG TTG GTC AGC AGC AAC CGC ACC AGC GCC AGC GAG CTG CGC AGC 
Asn Ser Leu Val Ser Ser Asn Arg Thr Ser Ala Ser Glu Leu Arg Ser 
65 70 75 80 


192 


240 


15 


TTC GTC AAC ACC GTG CGT TCG CGC CAC GGC AAC GCC CGC ATC GCG CTG 
Phe Val Asn Thr Val Arg Ser Arg His Gly Asn Ala Arg lie Ala Leu 
85 90 95 


288 


GTC GCC CAC TCC AAC GGC GGG CTG GTG TCG CGC TGG TAT CGC GCG GAG 336 
Val Ala His Ser Asn Gly Gly Leu Val Ser Arg Trp Tyr Arg Ala Glu 
100 105 110 

CTG GGC GGC GAA ACG GCC ACC CGC CGC TTC GTG ACG CTG GGC ACG CCG 384 
Leu Gly Gly Glu Thr Ala Thr Arg Arg Phe Val Thr Leu Gly Thr Pro 
115 120 125 

CAC CGG GGC ACC ACC TGG GCC TAT GCG TGC TAC AGC CCC GCA TGT TTC 432 
His Arg Gly Thr Thr Trp Ala Tyr Ala Cys Tyr Ser Pro Ala Cys Phe 
130 135 140 


30 


GAG ATG CGC CCC GGC TCC AGC TTG CTG ACC ACG CTG GGC TCG CGT GCC 
Glu Met Arg Pro Gly Ser Ser Leu Leu Thr Thr Leu Gly Ser Arg Ala 
145 150 155 160 


480 


35 


TGC GAC CGC TCG CTG TGG TCG AAC ACC GAC GGC ATC ATC CTG CCG GCG 
Cys Asp Arg Ser Leu Trp Ser Asn Thr Asp Gly He He Leu Pro Ala 
165 170 175 


528 


40 


TCC AGC GCG CAG TGT GGT GTC AGC ACG CGC ACT GCC GAC GTC AGC CAT 576 
Ser Ser Ala Gin Cys Gly Val Ser Thr Arg Thr Ala Asp Val Ser His 
180 185 190 

CTC GAC CTG CTG ACC GAC TCT CGC GTG TAC ACG CAG TTG CGC ACG CAG 624 
Leu Asp Leu Leu Thr Asp Ser Arg Val Tyr Thr Gin Leu Arg Thr Gin 
195 200 205 

TTG CAA TGAGGGTGAC GGTGCACCGA ACGTGCACCT G 661 
45 Leu Gin 

210 


50 


(2) INFORMATION FOR SEQ ID NO:53: 
(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 


Met 

Lys 

Val 

Lys 

nlS 

IT- 1 

vai 

lie 

vai 

Leu His Gly Leu 

Tyr 

Met 

Ser 

Gly 

1 




5 




10 




15 


Leu 

val 

Met Arg 

Pro 

Leu 

Cys 

Ser 

Arg Leu Glu Glu 

Ser 

Gly 

Val 

Lys 




20 





25 



30 



Val 

Leu 

Asn 

Leu 

Thr 

Tyr 

Asn 

Thr 

Arg Asp Pro Asn 

Arg 

Asp 

Ala 

He 



35 





40 



45 




Phe 

Thr 
50 

Gin 

He 

Asp 

Glu 

Phe 
55 

He 

Ser Asn Glu 

Pro 
60 

Ser 

Ala 

Leu 

Val 

Cys 

His 

Ser 

Met 

Gly 

Gly 

Leu 

Val 

Ala Arg Ala 

Tyr 

Leu 

Glu 

Ala 

Asn 

65 





70 



75 





80 

Ser 

Ala 

Pro 

Ser 

His 
85 

His 

Val 

Glu 

Lys Val He 
90 

Thr 

Leu 

Gly 

Thr 
95 

Pro 

His 

Thr 

Gly 

Ser 
100 

His 

He 

Ala 

Glu 

Lys Met Gin 
105 

Gin 

Lys 

Gly 
110 

Phe 

Glu 

Leu 

Leu 

Leu 
115 

Lys 

Asn 

Ser 

Val 

Glu 
120 

Phe Leu Leu 

Ser 

Lys 
125 

Asn 

Gly 

Asp 

Trp 

Pro 
130 

Phe 

Lys 

Ala 

Lys 

Leu 
135 

Tyr 

Ser He Ala 

Gly 
140 

Asp 

Leu 

Pro 

He 

Gly 

Leu 

Met 

Pro 

Leu 

He 

Val 

Lys 

Gly Ser Arg 

Ser 

Asp 

Gly 

Thr 

Val 

145 





150 



155 





160 

Leu 

Leu 

Asp Glu 

Thr 

Lys 

Leu 

Lys 

Gly Met Ala Glu 

His 

Lys 

Val 

Phe 





165 




170 




175 


His 

Leu 

Ser 

His 

Thr 

Ser 

Met 

lie 

Tyr Ser Arg Gin 

Val 

Val 

Asn 

Tyr 




180 





185 



190 



He 

Leu 

Glu 
195 

Arg 

Leu 

Asn 

Glu 

Asp 
200 

lie 







(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 


Met 

lie Lys 

Asn 

Phe 

Asp 

Arg 

Glu 

Asn 

Ser 

Ser 

Leu 

Val 

Leu 

Ser 

Gly 

1 



5 





10 





15 


Gly Gly Ala 

Leu 

Gly 

He 

Ala 

His 

Leu 

Gly Val 

Leu 

His 

Asp 

Leu 

Glu 



20 





25 





30 



Lys 

Gin Asn 
35 

He 

Val 

Pro 

Asn 

Glu 
40 

lie 

Val 

Gly 

Thr 

Ser 
45 

Met 

Gly 

Gly 

He 

lie Gly Ala 

Ser 

Met 

Ala 

He 

Gly Met 

Lys 

Glu 

Lys 

Glu 

He 

Leu 


50 




55 





60 





Glu 

Glu He 

Lys 

Asn 

Phe 

Ser 

Asn 

Val 

Phe 

Asn 

Trp 

He 

Lys 

Phe 

Ser 
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65 





70 

75 

80 

Phe 

Ser 

Gly 

Asn 

Ser 

Val Val 

Asp Asn Glu Lys He Ala 

Lys lie Phe 





85 


90 

95 

Asp 

Thr 

Leu 

Phe 

Lys 

Asp Arg 

Lys Met Thr Asp Thr Val 

He Pro Leu 




100 



105 

110 

Lys 

Leu 

lie 

Ala 

Thr 

Asn Leu 

His Asn Gly His Lys Lys Val Phe Thr 



115 




120 125 


Ala 

Ser 

Asp 

Asp 

Val 

Leu lie 

Lys Asp Ala He Leu Ser 

Thr Met Ala 


130 




135 

140 


lie 

Pro 

Gly 

Val 

Phe 

Glu Glu 

His He He Asp Gly Glu Thr Tyr Gly 

145 





150 

155 

160 

Asp 

Gly 

Phe 

Leu 

Cys 

Glu Asn 

Leu Gly Val Asn Glu Ala 

Thr Phe Asn 





165 


170 

175 

Asp 

Val 

Leu 

Ala 

Val 

Asp Val 

Met Gly Glu Asn Ser Phe 

Glu Lys Ala 




180 



185 

190 

Met 

Pro 

Asp 

Asn 

Phe 

Phe Lys 

Thr Ser Asn Val Leu Glu 

Met Phe Glu 



195 




200 205 


Lys 

Ser 

Met 

Arg 

Leu 

Phe He 

Tyr Asn Gin Thr Gin Thr 

His He Lys 


210 




215 

220 

Asn 

Ala 

Asn 

Lys 

Asn 

He Tyr 

Leu He Glu Pro Val Thr 

Lys Glu Tyr 

225 





230 

235 

240 

Lys 

Thr 

Phe 

Gin 

Phe 

His Lys 

His Lys Glu lie Arg Ala 

Leu Gly Leu 





245 


250 

255 

Gly 

Leu 

Leu 







(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 


Met 

Pro 

Leu 

His 

Pro 

Lys 

Val 

Lys Lys Leu Leu Ser 

Gin Leu 

Pro 

Pro 

1 




5 



10 


15 


Gin 

Asp 

Phe 

Ser 

Arg 

Asn 

Val 

Gin Asp Leu Arg Lys 

Ala Trp Asp 

Leu 




20 




25 

30 



Pro 

Phe 

Ser 

Gly 

Arg 

Arg 

Glu 

Thr Leu Lys Arg Val 

Glu Asp 

Leu 

Glu 



35 



Ala 


40 

45 



He 

Pro 

Thr 

Arg 

Asp 

Arg 

He Arg Ala Arg Val 

Tyr Thr 

Pro 

Ser 


50 





55 

60 




Ser 

Lys 

Glu 

Asn 

Leu 

Pro 

Val 

Leu Val Tyr Tyr His 

Gly Gly Gly 

Phe 

65 





70 


75 



80 

Val 

Phe 

Gly 

Ser 

Val 

Asp 

Ser 

Tyr Asp Gly Leu Ala 

Ser Leu 

He 

Ala 





85 



90 


95 


Lys 

Glu 

Ser 

Gly 

He 

Ala 

Val 

He Ser Val Glu Tyr Arg Leu 

Ala 

Pro 




100 




105 

110 



Glu 

His 

Lys 

Phe 

Pro 

Thr 

Ala 

Val Asn Asp Ser Trp Asp Ala 

Leu 

Leu 
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115 120 125 


Trp 

lie 

Ala 

Glu 

Asn 

Gly 

Gly 

Lys 

Leu 

Gly Leu Asp 

Thr 

Ser 

Arg 

Leu 

130 





135 





140 





Ala 

Val 

Ala 

Gly 

Asp 

Ser 

Ala 

Gly 

Gly 

Asn 

Leu 

Ser 

Ala 

Val 

Val 

Ser 

145 





150 





155 





160 

Leu 

Leu 

Asp 

Arg 

Asp 
165 

Gin 

Gly 

Lys 

Gly 

Leu 
170 

Val 

Ser 

Tyr 

Gin 

Val 
175 

Leu 

lie 

Tyr 

Pro 

Ala 

Val 

Asn 

Met 

Val 

Asp 

Asn 

Ser 

Pro 

Ser 

Val 

Arg 

Glu 



180 





185 





190 



Tyr 

Gly 

Glu 

Gly 

Tyr 

Phe 

Leu 

Thr 

Arg 

Ser 

Met 

Met 

Asn 

Trp 

Phe 

Gly 


195 





200 





205 




Thr 

Met 

Tyr 

Phe 

Ser 

Ser 

Gly 

Arg 

Glu 

Ala 

Val 

Ser 

Pro 

Tyr 

Ala 

Ser 


210 




215 





220 





Pro 

Ala 

Leu 

Ala 

Asp 

Leu 

His 

Asn 

Leu 

Pro 

Pro 

Ser 

Leu 

Val 

He 

Thr 

225 




230 





235 





240 

Ala 

Glu 

Tyr 

Asp 

Pro 

Leu 

Arg 

Asp 

Gin 

Gly Glu Thr 

Tyr 

Ser 

His 

Ser 




245 





250 





255 


Leu 

Asn 

Glu 

Ala 

Gly 

Asn 

Val 

Ser 

Thr 

Leu Val Arg 

Tyr Gin 

Gly Met 




260. 





265 





270 



He 

His 

Gly 

Phe 

Leu 

Ser 

Phe 

Tyr 

Glu 

Trp 

He 

Thr 

Ala Gly 

Lys 

Leu 



275 





280 





285 




Ala 

lie 

His 

His 

He 

Ala 

Gly 

Val 

Leu 

Arg 

Ser 

Val 

Leu 





290 295 300 


(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 


val 

Ala 

Phe 

Phe 

Asp 

Met 

Pro 

Leu 

Glu 

Glu 

Leu 

Lys 

Lys 

Tyr 

Arg Pro 

1 




5 





10 





15 

Glu 

Arg 

Tyr 

Glu 

Glu 

Lys 

Asp 

Phe 

Asp 

Glu 

Phe Trp 

Arg Glu 

Thr Leu 




20 





25 





30 


Lys 

Glu 

Ser 

Glu 

Gly 

Phe 

Pro 

Leu 

Asp 

Pro 

val 

Phe 

Glu 

Lys 

Val Asp 


35 





40 





45 



Phe 

His 

Leu 

Lys 

Thr 

val 

Glu 

Thr 

Tyr Asp Val 

Thr 

Phe 

Ser 

Gly Tyr 


50 





55 





60 




Arg 

Gly 

Gin 

Arg 

He 

Lys 

Gly 

Trp 

Leu 

Leu 

Val 

Pro 

Lys 

Leu 

Ala Glu 

65 





70 





75 




80 

Glu 

Lys 

Leu 

Pro 

Cys 
85 

Val 

Val 

Gin 

Tyr 

He 
90 

Gly 

Tyr 

Asn 

Gly 

Gly Arg 
95 

Gly 

Phe 

Pro 

His 

Asp 

Trp 

Leu 

Phe 

Trp 

Pro 

Ser 

Met 

Gly Tyr 

He Cys 



100 





105 





110 


Phe 

Val 

Met 

Asp 

Thr 

Arg 

Gly 

Gin 

Gly 

Ser Gly Trp 

Met 

Lys 

Gly Asp 



115 





120 





125 



Thr 

Pro 

Asp 

Tyr 

Pro 

Glu 

Gly 

Pro 

Val 

Asp 

Pro 

Gin 

Tyr 

Pro 

Gly Phe 
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130 



135 



140 

Met 

Thr Arg Gly lie 

Leu Asp 

Pro Gly Thr Tyr Tyr Tyr Arg Arg Val 

145 



150 




155 160 

Phe 

Val Asp Ala Val 

Arg 

Ala 

Val 

Glu 

Ala Ala He- Ser Phe Pro Arg 



165 





170 175 

Val 

Asp Ser 

Arg Lys 

Val 

Val 

Val 

Ala 

Gly Gly Ser Gin Gly Gly Gly 



180 




185 

190 

lie 

Pro Leu 

Ala Val 

Ser 

Ala 

Leu 

Ser 

Asn Arg Val Lys Ala Leu Leu 


195 




200 


205 

Cys 

Asp Val 

Pro Phe 

Leu 

Cys 

His 

Phe Arg Arg Ala Val Gin Leu Val 


210 



215 



220 

Asp 

Thr His 

Pro Tyr 

Val 

Glu 

He 

Thr 

Asn Phe Leu Lys Thr His Arg 

225 



230 




235 240 

Asp 

Lys Glu 

Glu He 

Val 

Phe 

Arg 

Thr 

Leu Ser Tyr Phe Asp Gly Val 



245 





250 255 

Asn 

Phe Ala 

Ala Arg 

Ala 

Lys 

Val 

Pro 

Ala Leu Phe Ser Val Gly Leu 



260 




265 

270 

Met 

Asp Thr 

He Cys 

Pro 

Pro 

Ser 

Thr 

Val Phe Ala Ala Tyr Asn His 


275 




280 


285 

Tyr 

Ala Gly Pro Lys 

Glu 

He 

Arg 

He 

Tyr Pro Tyr Asn Asn His Glu 


290 



295 



300 

Gly 

Gly Gly Ser Phe 

Gin 

Ala 

lie 

Glu 

Gin Val Lys Phe Leu Lys Arg 

305 



310 




315 320 

Leu 

Phe Glu 

Glu Gly 







325 


(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 


Met 

Arg 

Thr 

Leu 

Ser 

Phe 

Gly Pro Met Thr Thr Gly Gly Ser 

lie 

His 

1 




5 


10 

15 


Met 

Ala 

Thr 

Met 
20 

Asp 

Val 

Met Arg Gly Pro Gly Met Gin Arg 
25 30 

Leu 

Ser 

Gin 

Gly 

Ala 
35 

Arg 

Glu 

Ala 

Ala Asn His Pro Trp Ala Lys Arg 
40 45 

Leu 

Gly 

Arg 

Met 

Gly 

Tyr Ala Ala 

Lys Gly Ala Val Tyr Ala He He 

Gly 

Val 


50 





55 60 



Leu 

Ala 

Leu 

Lys 

Leu 

Ala 

Ala Gly Glu Gly Gly Arg Thr Thr 

Asp 

Ser 

65 





70 

75 


80 

His 

Gly 

Ala 

Val 

Asn 
85 

Thr 

Val Ala His Gly Pro Phe Gly Val 
90 

Ala 
95 

Leu 

Leu 

Ala 

Val 

Leu 
100 

Val 

Val 

Gly Leu Leu Gly Tyr Val Val Trp 
105 110 

Arg 

Phe 

Ala 

Gin 

Ala 
115 

Phe 

Val 

Asp 

Thr Glu Asp Lys Gly Ser Asp Ala 
120 125 

Lys 

Gly 
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He 

Ala 

Thr 

Arg 

Ala 

Met 

Tyr Phe Leu Ser 

Gly Cys He Tyr Ala Ser 


130 





135 

140 

Leu 

Ala 

Phe 

Phe 

Ala 

Ala 

Gin Ser Leu Val 

Gly Ala Ala His Gly Arg 

145 





150 


155 160 

Ser 

Lys 

Gly 

Thr 

Gin 

Gly 

Trp Thr Ala Thr 

Leu Met Glu Gin Pro Phe 





165 


170 

175 

Gly 

Arg 

val 

Leu 

Val 

Ala 

Leu Val Gly Leu 

Gly He Val Gly Phe Ala 




180 



185 

190 

Leu 

Lys 

Gin 

Phe 

His 

Thr 

Ala Trp Lys Ala 

Lys Phe Arg Glu Lys Leu 



195 




200 

205 

Thr 

Leu 

Thr 

Gly 

Leu 

Ala 

Ala Arg Lys Gin 

His His He Glu Arg Met 


210 





215 

220 

Cys 

Gin 

Phe 

Gly 

He 

Ala 

Ala Arg Gly Val 

Val Phe Ala Val He Gly 

225 





230 


235 240 

Gly 

Phe 

Leu 

Val 

Arg 

Ser 

Ala Val Asp Ala 

Asn Pro Gly Glu Ala Lys 





245 


250 

255 

Gly 

Leu 

Gly 

Glu 

Ala 

Leu 

Ala Val Val Ala 

Arg Gin Pro Ser Gly Asp 




260 



265 

270 

Val 

Leu 

Leu 

Gly 

Val 

Val 

Ala Ala Gly Leu 

Val Ala Tyr Ala Ala Tyr 



275 




280 

285 

Leu 

Phe 

Leu 

Gin 

Ala 

Arg 

Tyr Arg Glu Leu 



290 





295 



(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 


Met 

Ser 

Lys 

Phe 

Ala 

He 

Leu 

Trp Ala 

Leu He Thr Ala Tyr Leu 

Pro 

1 




5 




10 15 


Glu 

Pro 

Val 

Met 

Lys 

Leu 

Val 

Tyr Leu 

Gly Arg Arg Glu Thr Leu 

Gly 




. 20 




25 

30 


Ala 

Arg 

Thr 

Leu 

Asp 

Val 

Lys 

Ala Gin 

Ala Val Gly Arg Leu Ala 

Asn 



35 





40 

45 


Ala 

Thr 

Arg 

Pro 

Val 

Gly 

val 

He Pro 

Thr Val Glu Glu Ser Arg 

Lys 


50 





55 


60 

Met 

Thr 

Asp 

Lys 

Ala 

Val 

Ser 

Leu Phe 

Asp Gin Pro Ala Pro Glu 

Leu 

65 





70 



75 

80 

Phe 

Arg 

Lys 

Lys 

Asp 

He 

Gin 

He Asp Gly Ala Glu Gly Pro He 

Asp 





85 




90 95 


Ala 

Arg 

He 

Tyr 

Ser 

Gly 

Pro 

Ala Lys 

His Arg Xaa Xaa Pro Xaa 

Leu 




100 




105 

110 


val 

Tyr 

Phe 

His 

Gly 

Gly 

Gly 

Trp Val 

Gin Gly Asn Leu Asp Ser 

His 



115 





120 

125 



128 
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Asp 

Gly 
130 

Val Cys 

Gly 

Lys 

Leu 
135 

Ala 

Lys 

Trp Ala Asn 
140 

Cys 

He 

Val He 


Ser 

Val 

Asp Tyr 

Arg 

Leu 

Ala 

Pro 

Glu 

His Lys Phe 

Pro 

Cys 

Ala Pro 

5 

145 




150 




155 



160 


Leu 

Asp 

Ala He 

Ala 
165 

Ala 

Tyr 

Lys 

Trp 

Val Arg Ala 
170 

Asn 

Ala 

Thr Asn 
175 


Leu 

Gly 

Gly Asp 
180 

Pro 

Glu 

Arg 

lie 

Gly 
185 

Val Gly Gly 

Asp 

Ser 
190 

Ala Gly 

10 

Gly 

Asn 

Leu Ala 
195 

Ala 

Val 

Val 

Cys 
200 

Gin 

Gin Thr Ala 

Met 
205 

Asn 

Gly Glu 


Arg 

Thr 
210 

Pro Asp 

Leu 

Gin 

Val 
215 

Leu 

He 

Tyr Pro Ala 
220 

Leu 

Asp 

Ala Arg 


Met 

He 

Ser Thr 

Ser 

Met 

Glu 

Glu 

Leu 

Arg Asp Ala 

Tyr 

lie 

Leu Pro 

15 

225 




230 




235 



240 


Lys 

Ser 

Arg Met 

Glu 
245 

Tyr 

Phe 

Leu 

Gly 

Leu Tyr Thr 
250 

Arg 

Gly 

Pro Asp 
255 


Asp 

He 

Glu Asp 
260 

Leu 

Arg 

Met 

Ser 

Pro 
265 

He Leu Arg 

Asp 

Thr 
270 

Val Ala 

20 

Asp 

Gin 

Pro Gin 

Ala 

Cys 

lie 

Val 

Thr 

Cys Gly Phe 

Asp 

Pro 

Ala Arg 



275 




280 



285 




Arg 

Arg 
290 

Glu His 

Leu 

Arg 

Arg 
295 

Thr 

Leu 

Asn Cys Arg 
300 

Gly 

Asp 

Arg Arg 


25 

(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 308 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

40 


50 


Val 

Ser 

He 

Arg 

Leu 

Arg 

Leu 

Leu 

Asn 

Trp 

Phe Leu 

Asn 

Thr 

Phe 

Glu 

1 




5 





10 




15 


Lys 

Pro 

Lys 

Leu 
20 

Ala 

Ala 

Ala 

Lys 

Thr 
25 

Pro 

Asp Asp 

Leu 

Arg 
30 

Lys 

Ser 

Phe 

Glu 

Leu 
35 

Lys 

Ala 

Arg 

Phe 

Leu 
40 

Phe 

Pro 

Ala Pro 

Arg 
45 

Lys 

Thr 

Arg 

Phe 

Ser 
50 

His 

Asp 

Val 

Leu 

Gin 
55 

Ser 

Gly 

He 

Gly Ser 
60 

Val 

Asn 

Ala 

Gin 

Trp 

Ala 

Lys 

Ser 

Lys 

Ser 

Ala 

Ser 

Asp 

Asp Arg Val 

He 

Leu 

Tyr 

Phe 

65 





70 





75 




80 

His 

Gly 

Gly 

Gly 

Tyr 
85 

Val 

Phe 

Gly 

Ser 

Pro 
90 

Lys Thr 

His 

Arg 

Ala 
95 

Met 

Leu 

Ala 

Arg 

Leu 
100 

Ser 

Ala 

Met 

Thr 

Gly 
105 

Leu 

Ser Ala 

Cys 

Leu 
110 

Pro 

Asp 

Tyr 

Arg 

Leu 
115 

Ala 

Pro 

Glu 

His 

Pro 
120 

Phe 

Pro 

Ala Ala 

He 
125 

Glu 

Asp 

Ala 

val 

Leu 
130 

Ser 

Tyr 

Lys 

Cys 

Leu 
135 

Leu 

Glu 

Arg 

Ala He 
140 

Glu 

Pro 

Gin 

Asn 


129 
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5 


10 


20 


He 

He 

Leu 

Gly 

Gly 

Asp 

145 





150 

Leu 

Ala 

Glu 

He 

Lys 
165 

Ala 

Ala 

Leu 

Ser 

Pro 
180 

Leu 

Val 

Lys 

Asn 

Ala 
195 

Gin 

Thr 

Asp 

Met 

Ala 
210 

Thr 

Leu 

Tyr 

Leu 

Ser 

Pro 

Leu 

Gin 

Ala 

Asp 

225 





230 

Ala 

Ser 

Asp 

Ser 

Glu 
24 5 

He 

His 

Leu 

Arg 

Ala 
260 

Gin 

Gly 

His 

Pro 

His 
275 

Val 

Trp 

His 

Gin 

Gly 
290 

Leu 

Arg 

Ala 

He 

Ser 

Asn 

Glu 

Ser 




305 


25 (2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
35 ( V ) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID 


40 

Met 

1 

Leu 

Thr 

Phe 

Asn 
5 

Val 

Leu 


Ala 

He 

Leu 

Met 
20 

Phe 

Leu 

Gly 


Pro 

Asp 

Leu 

Gin 

Lys 

Tyr 

Arg 

45 



35 






Pro 

Lys 
50 

Glu 

Asn 

Arg 

Lys 

Val 
55 


Ala 

Trp 

He 

Ser 

Gin 

Arg 

Pro 


65 





70 


50 

Gly 

Arg 

Gly 

He 

Ser 
85 

Gly 

Gin 


Arg 

Gin 

Asp 

Val 
100 

He 

Asp 

Leu 


Gly 

Thr 

Asn 
115 

Asp 

Val 

Ala 

Gin 

55 

Ser 

Leu 
130 

Ala 

Asn 

He 

Lys 

Ser 
135 


Ser 

Ala Gly Gly Gly Leu 

Val Leu Ala 

Leu 


155 


160 

Gin 

Ser Leu Pro Lys Pro 

Ala Gly Val 

Phe 


170 

175 


Asp 

Leu Ser Phe Ser Gly 

Leu Ser Phe 

Ser 


185 

190 


Val 

Met Leu Pro Ala Ser 

Arg Ala Ala 

Asp 


200 

205 


Asp 

Gly Ala Asp Ala Asp 

Asp Pro Arg 

Ala 

215 

220 



Phe 

Ser Gly Met Pro Pro 

Val Phe Leu 

Thr 


235 


240 

Leu 

Leu Asp Asp Cys Leu 

Arg Met Ala 

Asp 


250 

255 


Val 

Val Val Thr Asp Arg 

lie Val Glu 

Asn 


265 

270 


He 

Phe Gin Arg Leu Leu 

Pro Glu Ala 

Asp 


280 

285 


Ala 

Ala Trp He Lys Pro 

Leu Leu Ser 

Gly 

295 

300 




NO:60: 


Tyr 

Gly Met 

Met 

Lys 

Gin 

Lys 

Leu 

Ala 



10 





15 


Leu 

Ser 

Ala 

Ala 

Glu 

Ala 

Gin 

Asp 

Trp 


25 





30 



Ser 

Ala 

Asn 

Lys 

Glu 

Ala 

Lys 

Leu 

Leu 

40 





45 




Val 

Phe 

Met 

Gly Asn Ser 

lie 

Thr 

Glu 





60 





Glu 

Phe 

Phe 

Ser 

Glu 

Asn 

Gly 

Phe 

He 




75 





80 

Thr 

Thr 

Pro 

Gin 

Met 

Leu 

Leu 

Arg 

Phe 



90 





95 


Gin 

Pro 

Lys 

Ala 

Val 

Val 

He 

Leu 

Ala 


105 





110 



Asn 

Thr 

Gly 

Pro 

Met 

Thr 

He 

Glu 

Glu 

120 





125 




Met 

Val 

Glu 

Leu 

Ala 

Gin 

Ala 

Asn 

Gly 


140 


130 
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lie Thr Pro Val Leu Cys Thr Val Leu Pro Ala Asp Arg Phe Ser Trp 
145 150 155 160 

Arg Pro Glu Leu Thr Pro Ala Glu Thr He He Ala Leu Asn Gin Leu 
5 165 170 175 

He Lys Gin Tyr Ala Glu Ala Gin Gly Leu Ala Leu Val Asp Tyr His 

180 185 190 

Ala Ala Leu Thr Asn Lys Gly Gly Gly Leu Pro Val Lys Tyr Gly Glu 
195 200 205 

10 Asp Gly Val His Pro Asn Val Ala Gly Tyr Gin Val Met Glu Asn He 

210 215 220 

Val Leu Pro Val He Ser Ser Glu Leu Ala Lys Leu Lys 
225 230 235 

15 (2) INFORMATION FOR SEQ ID NO:61 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
25 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 


30 

Met 

1 

Ala 

Phe 

Phe 

Asp 
5 

Leu 

Pro 

Leu 

Glu 

Glu Leu 
10 

Lys 

Lys 

Tyr 

Arg 
15 

Pro 


Glu 

Arg 

Tyr 

Glu 
20 

Glu 

Lys 

Asp 

Phe 

Asp 
25 

Glu Phe 

Trp 

Glu 

Glu 
30 

Thr 

Leu 


Ala 

Glu 

Ser 

Glu 

Lys 

Phe 

Pro 

Leu 

Asp 

Pro Val 

Phe 

Glu 

Arg 

Met 

Glu 

35 



35 





40 




45 




Ser 

His 
50 

Leu 

Lys 

Thr 

Val 

Glu 
55 

Ala 

Tyr 

Asp Val 

Thr 
60 

Phe 

Ser 

Gly 

Tyr 


Arg 

Gly 

Gin 

Arg 

He 

Lys 

Gly 

Trp 

Leu 

Leu Val 

Pro 

Lys 

Leu 

Glu 

Glu 


65 





70 




75 





80 


Glu 

Lys 

Leu 

Pro 

Cys 

Val 

Val 

Gin 

Tyr 

He Gly 

Tyr 

Asn 

Gly 

Gly 

Arg 

40 





85 





90 




95 



Gly 

Phe 

Pro 

His 
100 

Asp 

Trp 

Leu 

Phe 

Trp 
105 

Pro Ser 

Met 

Gly 

Tyr 
110 

He 

Cys 


Phe 

Val 

Met 
115 

Asp 

Thr 

Arg 

Gly 

Gin 
120 

Gly 

Ser Gly 

Trp 

Leu 
125 

Lys 

Gly 

Asp 

45 

Thr 

Pro 
130 

Asp 

Tyr 

Pro 

Glu 

Gly 
135 

Pro 

Val 

Asp Pro 

Gin 
140 

Tyr 

Pro 

Gly 

Phe 


Met 

Thr 

Arg 

Gly 

lie 

Leu 

Asp 

Pro 

Arg 

Thr Tyr 

Tyr 

Tyr 

Arg 

Arg 

Val 


145 





150 




155 





160 


Phe 

Thr 

Asp 

Ala 

Val 

Arg 

Ala 

Val 

Glu 

Ala Ala 

Ala 

Ser 

Phe 

Pro 

Gin 

50 





165 





170 




175 


Val 

Asp 

Gin 

Glu 
180 

Arg 

He 

Val 

lie 

Ala 
185 

Gly Gly 

Ser 

Gin 

Gly 
190 

Gly 

Gly 


He 

Ala 

Leu 
195 

Ala 

Val 

Ser 

Ala 

Leu 
200 

Ser 

Lys Lys 

Ala 

Lys 
205 

Ala 

Leu 

Leu 


Cys 

Asp 

Val 

Pro 

Phe 

Leu 

Cys 

His 

Phe Arg Arg 

Ala 

Val 

Gin 

Leu 

Val 

55 


210 





215 




220 






Asp 

Thr 

His 

Pro 

Tyr 

Ala 

Glu 

He 

Thr 

Asn Phe 

Leu 

Lys 

Thr 

His 

Arg 


131 
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240 

Asp 

Lys 

Glu 

Glu 

lie 

OA C\ 

val 

Phe Arg Thr 

Leu 

Ser 

Tyr 

Phe 

Asp Gly -Val 
255 

Asn 

Phe 

Ala 

Ala 

Arg 

Ala 

Lys lie Pro 

Ala 

Leu 

Phe 

Ser Val Gly Leu 




260 



265 





270 

Met 

Asp 

Asn 

lie 

Cys 

Pro 

Pro Ser Thr 

Val 

Phe 

Ala 

Ala 

Tyr Asn Tyr 



275 




280 




285 

Tyr 

Ala 
290 

Gly 

Pro 

Lys 

Glu 

lie Arg lie 
295 

Tyr 

Pro 

Tyr 
300 

Asn 

Asn His Glu 

Gly 

Gly 

Gly 

Ser 

Phe 

Gin 

Ala Val Glu 

Gin 

Val 

Lys 

Phe 

Leu Lys Lys 

305 





310 



315 



320 

Leu 

Phe 

Glu 

Lys 

Gly 
325 









(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 


Leu 

Lys 

Tyr 

Phe 

Lys 

Ala 

Arg 

Leu 

Ala 

Gly lie Thr Leu Leu Gly Leu 

1 




5 





10 15 

Leu 

Ala 

Cys 

Thr 

Ser 

Ala 

Ser 

Ala 

Gin 

Thr Glu Pro He Val Phe Val 




20 





25 

30 

His 

Gly 

Tyr 

Ser 

Gly 

Ser 

Ala 

Ser 

Asn 

Trp Asp Thr Met Leu Gly Arg 



35 





40 


45 

Phe 

Arg 

Ser 

Asn 

Gly 

Tyr 

Ala 

Ser 

Gly Ser Leu Tyr Thr Phe Asn Tyr 


50 





55 



60 

Asn 

Ser 

Leu 

Val 

Ser 

Ser 

Asn 

Arg 

Thr Ser Ala Ser Glu Leu Arg Ser 

65 





70 




75 80 

Phe 

Val 

Asn 

Thr 

Val 

Arg 

Ser 

Arg 

His 

Gly Asn Ala Arg He Ala Leu 





85 





90 95 

Val 

Ala 

His 

Ser 

Asn 

Gly 

Gly 

Leu 

Val 

Ser Arg Trp Tyr Arg Ala Glu 




100 





105 

110 

Leu 

Gly 

Gly 

Glu 

Thr 

Ala 

Thr 

Arg 

Arg 

Phe Val Thr Leu Gly Thr Pro 



115 





120 


125 

His 

Arg 

Gly 

Thr 

Thr 

Trp 

Ala 

Tyr 

Ala 

Cys Tyr Ser Pro Ala Cys Phe 


130 





135 



140 

Glu 

Met 

Arg 

Pro 

Gly 

Ser 

Ser 

Leu 

Leu 

Thr Thr Leu Gly Ser Arg Ala 

145 





150 




155 160 

Cys 

Asp 

Arg 

Ser 

Leu 

Trp 

Ser 

Asn 

Thr Asp Gly lie lie Leu Pro Ala 





165 





170 175 

Ser 

Ser 

Ala 

Gin 

Cys 

Gly 

Val 

Ser 

Thr 

Arg Thr Ala Asp Val Ser His 




180 





185 

190 

Leu 

Asp 

Leu 

Leu 

Thr 

Asp 

Ser 

Arg 

Val 

Tyr Thr Gin Leu Arg Thr Gin 



195 





200 


205 


Leu Gin 


210 


132 
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Claims 

1. A polynucleotide comprising a member selected from the group consisting of: 

a) a polynucleotide comprising a nucleic acid sequence set forth in SEQ ID NO:23; 

b) a polynucleotide which encodes an enzyme comprising an amino acid sequence set forth in SEQ ID NO:33; 

c) a polynucleotide comprising at least 15 consecutive bases of a polynucleotide of (a) or (b); 

d) a polynucleotide having at least 70%, 80%, 90%, 95% or 97% identity to the polynucleotide of (a) or (b); 
and encoding an enzyme having esterase activity; and 

e) a polynucleotide which is complementary to a polynucleotide of any one of (a) to (d). 

2. A polynucleotide hybridizing under stringent conditions to: 

a) a polynucleotide comprising a nucleic acid sequence set forth in SEQ ID NO:23; or 

b) a polynucleotide which is complementary to a polynucleotide of (a); 

and encoding an enzyme having esterase activity. 

3. The polynucleotide of claim 1 or 2, wherein the polynucleotide is DNA or RNA. 

4. A vector comprising the polynucleotide of any one of claims 1 to 3. 

5. A host cell comprising the vector of claim 4. 

6. A process for producing a polypeptide comprising expressing from the host cell of claim 5 a polypeptide encoded 
by said polynucleotide. 

7. A process for producing a cell comprising transforming or transfecting the cell with the vector of claim 4 such that 
the cell expresses the polypeptide encoded by the polynucleotide contained in the vector. 

8. An enzyme comprising a member selected from the group consisting of: 

(a) an enzyme encoded by the polynucleotide of any one of claims 1 (a), (b) or (d); 

(b) an enzyme comprising an amino acid sequence which is at least 70%, 90% or 95% identical to an amino 
acid sequence set forth in SEQ ID NO: 33 ; 

(c) an enzyme which comprises at least 30 amino acids of the enzyme o (a) or (b); and 

(d) an enzyme obtainable by the process of claim 6. 

9. An antibody specifically recognizing the enzyme of claim 8. 

10. A method for transferring an amino group from an amino acid to an a-keto acid comprising contacting an amino 
acid in the presence of an a-keto acid with the enzyme of claim 8. 

11. A method for the isolation of the enzyme of claim 8 comprising using the antibody of claim 9. 

12. Use of the enzyme of claim 8: 

(a) as ripening starter for cheese in the dairy industry; 

(b) for lignin removal in the pulp and paper industry; 

(c) for the synthesis of carbohydrate derivatives; and/or 

(d) for the conversion of lignocellulosic wastes to fermentable sugars. 

13. Use of the polynucleotide of claim 1 or 2 for the identification of similar sequences in other organisms. 


Patentanspruche 

1. Polynucleotid, umfassend ein Mitglied ausgewahlt aus der Gruppe bestehend aus: 
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(a) einem Polynucleotid, umfassend eine Nucleinsauresequenz wie in SEQ ID NR:23 dargestellt; 

(b) einem Polynucleotid, das ein Enzym codiert, umfassend eine Aminosauresequenz wie in SEQ ID NR:33 
dargestellt; 

(c) einem Polynucleotid, umfassend mindestens 15 aufeinanderfolgende Basen eines Polynucleotids aus (a) 
5 oder (b); 

(d) einem Polynucleotid, das mindestens 70%, 80%, 90%, 95% oder 97% Identitat mit dem Polynucleotid aus 
(a) oder (b) hat; und ein Enzym mit Esterase-Aktivitat codiert; und 

(e) einem Polynucleotid, das zu einem Polynucleotid aus (a) bis (d) komplementar ist. 

10 2. Polynucleotid, das unter stringenten Bedingungen an: 

(a) ein Polynucleotid, umfassend eine Nucleinsauresequenz wie in SEQ ID NR:23 dargestellt; oder 

(b) ein Polynucleotid, das zu einem Polynucleotid aus (a) komplementar ist; hybridisiert und ein Enzym mit 
Esterase-Aktivitat codiert. 

15 

3. Polynucleotid nach Anspruch 1 oder 2, wobei das Polynucleotid DNA oder RNA ist. 

4. Vektor, umfassend das Polynucleotid nach einem der Anspruche 1 bis 3. 

20 5. Wirtszelle, umfassend den Vektor nach Anspruch 4. 

6. Verfahren zur Herstellung eines Polypeptids, umfassend das Exprimieren eines Polypeptids, das von dem Po- 
lynucleotid codiert wird, von der Wirtszelle nach Anspruch 5. 

25 7. Verfahren zur Herstellung einer Zelle, umfassend das Transformieren oder Transfizieren der Zelle mit dem Vektor 
nach Anspruch 4, so dass die Zelle das Polypeptid exprimiert, das von dem im Vektor enthaltenen Polynucleotid 
codiert wird. 

8. Enzym, umfassend ein Mitglied ausgewahlt aus der Gruppe bestehend aus: 

30 

(a) einem Enzym, das von dem Polynucleotid nach einem der Anspruche 1 (a), (b) oder (d) codiert wird; 

(b) einem Enzym, umfassend eine Aminosauresequenz, die zu mindestens 70%, 90% oder 95% identisch ist 
zu einer Aminosauresequenz wie in SEQ ID NR:33 dargestellt; 

(c) einem Enzym, das mindestens 30 Aminosauren des Enzyms aus (a) Oder (b) umfasst; und 
35 (d) einem Enzym, erhaltlich durch das Verfahren nach Anspruch 6. 

9. Antikorper, der spezifisch das Enzym nach Anspruch 8 erkennt. 

10. Verfahren zum Transferieren einer Aminogruppe von einer Aminosaure zu einer a-Ketosaure, umfassend das 
to Inkontaktbringen einer Aminosaure mit dem Enzym nach Anspruch 8 in Anwesenheit einer a-Ketosaure. 

11. Verfahren zum Isolieren des Enzyms nach Anspruch 8, umfassend die Verwendung des Antikorpers nach An- 
spruch 9. 

45 12. Verwendung des Enzyms nach Anspruch 8: 

(a) als Reifestarter fur Kase in der Milchindustrie; 

(b) zur Ligninentfernung in der Ze II stoff Industrie; 

(c) zur Synthese von Kohlenhydratderivaten; und/oder 

so (d) zur Umwandlung von Lignozelluloseabfallen zu fermentierbaren Zuckern. 

13. Verwendung des Polynucleotids nach Anspruch 1 oder 2 zur Identifizierung von a hn lichen Sequenzen in anderen 
Organismen. 

55 

Revendications 

1. Polynucleotide comprenant un membre choisi dans le groupe constitue de : 
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a) un polynucleotide comprenant une sequence d'acide nucieique representee dans SEQ ID NO : 23 ; 

b) un polynucleotide qui code une enzyme comprenant une sequence d'acides amines representee dans SEQ 
ID NO : 33 ; 

c) un polynucleotide comprenant au moins 15 bases cons6cutives d'un polynucleotide de a) ou b) ; 

d) un polynucleotide ayant au moins 70%, 80%, 90%, 95% ou 97% d'identite avec le polynucleotide de a) ou 
b) ; et codant une enzyme ayant une activite esterase ; et 

e) un polynucleotide compiementaire de I'un quelconque des polynucleotides de a) a d). 

2. Polynucleotide qui hybride dans des conditions stringentes a : 

a) un polynucleotide comprenant une sequence d'acide nucieique representee dans SEQ ID NO : 23 ; ou 

b) un polynucleotide compiementaire d'un polynucleotide de a) ; 

et codant une enzyme ayant une activite esterase. 

3. Polynucleotide selon la revendication 1 ou 2, dans lequel le polynucleotide est de r ADN ou de TARN. 

4. Vecteur comprenant le polynucleotide selon Tune quelconque des revindications 1 a 3. 

20 5. Cellule note comprenant le vecteur selon la revendication 4. 

6. Procede de production d'un polypeptide comprenant I'expression, a partir de la cellule hote selon la revendication 
5, d'un polypeptide code par ledit polynucleotide. 

25 7. Procede de production d'une cellule comprenant la transformation ou la transfection de la cellule avec le vecteur 
selon la revendication 4 de sorte que la cellule exprime le polypeptide code par le polynucleotide contenu dans le 
vecteur. 


10 


15 


30 


8. Enzyme comprenant un membre choisi dans le groupe constitue de : 


a) une enzyme codee par le polynucleotide selon Tune quelconque des revendications 1(a), (b) ou (d) ; 

b) une enzyme comprenant une sequence d'acides amines identique a au moins 70%, 90% ou 95% a une 
sequence d'acides amines representee dans SEQ ID NO : 33 ; 

c) une enzyme comprenant au moins 30 acides amines de I'enzyme de (a) ou (b) ; et 
35 d) une enzyme susceptible d'etre obtenue par le procede de la revendication 6. 

9. Anticorps reconnaissant specifiquement I'enzyme selon la revendication 8. 

10. Methode pour transferer un groupe amino d'un acide amine a un a-c6to acide, comprenant la mise en contact 
40 d'un acide amine en presence d'un a-ceto acide avec I'enzyme selon la revendication 8. 

11. Methode pour isoler I'enzyme selon la revendication 8, comprenant ['utilisation de I'anticorps selon la revendication 
9. 

45 12. Utilisation de I'enzyme selon la revendication 8 : 

a) comme inducteur de la maturation du fromage dans I'industrie laitiere ; 

b) pour eliminer la lignine dans I'industrie de la pate et du papier ; 

c) pour la synthese de derives d'hydrates de carbone ; et/ou 

50 d) pour la conversion de dechets ligno-cellulosiques en sucres fermentables. 

13. Utilisation du polynucleotide de la revendication 1 ou 2 pour ('identification de sequences similaires dans d'autres 
organismes. 

55 
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PX0UB8 1 

Stapoylotberaru* aariaoa * P1-12LC 

ATO TCT TTA AAC AAG CAC TCT TGG ATG OAT ATG ATA ATA TXT ATT CTC AGC TTT TCT TTC 60 

Met Ser Leu Aen Lys His Ser Trp Het Asp Met He He Phe lie Leu Sex Phe Ser Phe 20 

CCA TTA ACA ATG ATC CCA TTA OCT ATC TCT ATO TCG TCA TOG TTT AAT ATA TGG AAT AAT 120 

Pro Leu Ttor wet He Ala Leu Ala He Ser Met Ser Ser Trp Phe Asn He Trp Asn Asn 40 

GCA TtA AGC GAT CTA GOA CAT GCT GTT AAA AGC ACT GTT GCT CCA ATA TTC AAT CTA GGT 1B0 

Ala Leu Ser Asp Leu Gly His Ala Val Lys Ser Ser Val Ala Pro He Phe Aao Leu Gly $0 

CTT GCA ATT GOT GOO ATA CTA ATT GTT ATA GTT GGT TTA AGA AAT CTT TAT TOG TOO ACT 240 

Leu Ala He Gly Gly lie Leu lie Val He Val Gly Leu Arg Abu Leu Tyr Ser Trp Ser BO 

AGA GTT AAA GGA TCT TTA ATC ATA TCC ATG GOT OTA TTT CTT AAC TTA ATA GGG GTT TTC 300 

Arg Val Lya Oly Ser Leu He He Ser Met Gly Val Phe Leu Asn Leu He Gly Val Phe ioo 

GAC GAA GTA TAT GGT TGG ATA CAT TTC CTA CTC TCA GTA TTO TTT TTC TTA TCA ATA ATA 360 

Asp Glu Val Tyr Gly Trp He His Phe Leu Val Ser Val Leu Phe Phe Leu Ser He He 120 

GCA TAT TTC ATA GCT ATA TCA ATA CTT GAC AAA TCA TGG ATA GCT OTT CTA CTA ATA ATA 420 

Ala Tyr Phe He Ala He Ser He Leu Asp Lye Ser Trp He Ala Val Leu Leu He He 140 

GGT CAT ATT GCA ATG TGG TAT CTA CAC TTT GCT TCA GAG ATT COS AGA GGT GOG GCT ATT 480 

Gly His He Ala Met Trp Tyr Leu Hia Phe Ala Ser Glu He pro Arg Gly Ala Ala He 160 

CCC GAG TTA TTA OCG GTA TTC TOG TTT TTA CCA TTC TAT ATA AGA GAC TAT TTT AAA TCA 540 

. Pro Glu Leu Leu Ala Val Phe Ser Phe Leu Pro Phe Tyr He Arg Asp Tyr Phe Lys Ser 180 

TAC ACT AAA CGA TAG SS5 

Tyr Thr Lys Arg 185 
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FX CQStB 2 
Pyrodictiua - TA011-17LC 

ATG MA CTC CTT GAG CCC ACA AAT ACC TCC TAC ACQ CPO TTA CAO GAT TTA OCA TTG CAT €0 
Met Lyo Leu lieu Glu Pro Thr Aon Thr Ser Tyr Thr Leu Leu Gla Asp Leu Ala Leu His 20 

TTT CCA TTT TAC TOO CTT CTO CCC GTO TAT ACG TOO TTA CCC GOT CTC CTA GTC COG GGC 120 
Phe Ala Phe Tyr Trp Phe Leu Ala Val Tyr Thr Trp Leu Pro Gly Val Leu val Arg Qly . 40 

CTA GCT GTO GAC Ad GGG GTO OCT CCG GTO OCT GGG CTC GGC CGO CGC GOT AAG AGO CTG 100 
Val Ala Val Asp Thr Gly Val Ala Arg val pro Gly Leu Gly Arg Arg Gly Lya Arg Leu €0 

CTC CTG GCC GCT GTG GCT GTC TTO GCO CTT GTT GTO TCC GIT GTT GTC CCG GCT TAT GTG 340 
Leu Leu Ala Ala Val Ala Val Leu Ala Leu Val Val Sex Val Val Val Pro Ala Tyr Val eo 

003 TAT ACT ACT CTO CAC CCG GAG AGC TOT CGO CCC GTT GCO CCG GAG GGG CTC ACC TAC 300 
Ala Tyr Ser Ser Leu His Pro Glu Ser eye Arg Pro Val Ala Pro Glu Gly Leu Thr Tyr 100 

AAA GAG TTC AGC GTO ACC GCG GAG GAT GGC TTG GTG GTT CGO GGC TGG GTG CTG GGC CCC 360 
Lya Glu Phe Sex Val Thr Ala Glu Aop Gly Leu val Val Arg Gly Trp Val Leu Gly Pro 120 

GC GCT GGG GGC AAC CCG GTO TTC GTT TTG ATG CAC GGG TAT ACT GGG TOC CGC TOG O05 420 
Gly Ala Gly Gly Asn Pro Val Phe Val Leu Met Mia Gly Tyr Thr Gly Cys Arg Ser Ala 140 

CCC TAC ATG GCT GTG CTO GCC GGG GAG CTC GTG GAS TGG GGG TAC CCG GTG GTT GTO TTC 480 
Pro Tyr Mat Ala Val Leu Ala Arg Glu Leu Val Glu Trp oly Tyr Pro Val Val val Phe 160 

GAC TTC CGO GGC CAC GGG GAG AGC GGG GOC TCG ACO ACQ ATT GGG CCC CGG GAG GTG CTG 540 
Asp Phe Arg Gly His Gly Glu Sex Gly Oly Ser Thr Thr lie Gly Pro Arg Glu Val Leu 180 

GAT GCC CGG GCT GTG GTO GGC TAT GTC TCO GAG CGG TTC CCC GGC CGC CGO ATA ATA TTG 600 
Asp Ala Arg Ala Val Val Oly Tyr Val Ser Glu Arg Phe Pro Gly Arg Arg lie He Leu 200 

GTG GGG TTC ACT ATG GGC GGC GCT GTA GCO ATC GTO GAG GOT GCT GGG GAC CCG CGG GTC €£0 
Val Gly Phe Ser Met Oly Gly Ala Val Ala He Val Glu Gly Ala Gly Aop Pro Arg Val 220 

TAC GCG GTG GCT GCT GAT AGC COO TAC TAT AGO CTC CGG GAC GTC ATA CCC CGG TGG CTG 720 
Tyr Ala Val Ala Ala Asp Ser Pro Tyr Tyr Arg Leu Arg Asp Val lie Pro Arg Trp Leu 240 

GAG TAC AAG ACG CCG CTG CCG GGC TGG GTO GOT GTO CTO GCC GOO TTC TAC GGG AGO CTG 780 
Glu Tyr Lys Thr Pro Leu Pro Gly Trp Val Gly Val Leu Ala Gly Phe Tyr Gly Arg Leu 260 

ATG GCG GGC GTT GAC CTC GGC TTC GGC CCC GCT GGG GTG GAG CGC GTG GAT AAG CCG TTG 840 . 
Met: Ala Gly val Asp Leu Gly Phe Gly Pro Ala Oly Val Glu Arg Val Asp Lya Pro Leu 280 . 

CTG GTG GTG TAT GGG CCC CGG GAC CCG CTG GTG ACG CGG GAC GAG GCG AGO AGC CTO GCG 900 
Leu val Val Tyr Gly Pro Arg Asp Pro Leu Val Thr Arg Asp Glu Ala Arg Ser leu Ala 300 

TCC CGT AGC COO TGT GGC COT CTC GTC GAG GTT CCT GGG GCT GGC CAC GTO GAG GCC GTC 960 
Ser Arg Ser Pro Cys Gly Arg Leu Val Glu val Pro Gly Ala Gly Bis Val Glu Ala Val 320 \ 

GAT GTG CTC GGG CCG GGC CGC TAC GGA GAC ATQ CTG ATA GAG CTG GCO CAC GAG GAG TGC 1020 
Asp Val Leu Gly Pro Gly Arg Tyr Ala Asp Het Leu Zle Glu Leu Ala Bis Glu Glu Cys 340 

CCT CCG GGG GCC GOT GGC TGA 1041 
Pro Pro Gly Ala Gly Gly 3«6 


137 


EP 0 880 590 B1 


FXGU&B 3 

Archaeoglobus Venl£lcus SH 

ATG CCA TAT OTT AGO AAT GGT GGT GTA AAT ATC TAT TAT GAA CTG GTG GAT GGA CCT QAO 60 

Hot Pro Tyx Vol Arg Asn Oly Oly Val Aon lie Tyr Tyr Glu Leu Val Asp Oly pro Olu 20 

CCA CCA ATT GTC TTT GTT CAC GOA TGG ACA GCA AAT ATO AAT TTT'TGG AAA GAG CAA AGA 120 

Pro Pro lie Val Phe Val His Gly Trp Thr Ala Asn Met Asn Phe Trp Lys Glu Ola Arg 40 

CGT TAT TTT GCA GGC AGO AAT ATG ATO TTO TTT GTC GAT AAC AGA GGT CAT GGC AGO TCC ISO 

Arg Tyr Phe Ala Oly Arg Aon Met Met Leu Phe val Asp Asn Arg Gly His Gly Arg Ser 60 

GAT AAG CCA CTT GGA TAC GAT TTC TAC AGA TTT GAO AAC TTC ATT TCA GAT TTA OAT GCG 240 

Asp Lys Pro Leu Gly Tyr Asp Phe Tyr Arg Phe Glu Asn Phe Ho Ser Asp Leu Asp Ala 60 

GTT GTT AGO QAO ACT GGA GTG GAG AAA TTT GTT CTC GTC GGA CAT TCA TTC GGA ACA ATG 300 

val Val Arg Glu Thr Gly Val Glu Lys Phe Val Leu Val Gly His Ser Phe Gly Thr Met lbo 

ATC TCT ATG AAG TAC TGT TCG GAG TAT CGG AAT CGG GTT CTT GCT CtA ATC CTC ATA GGT 360 

He Ser Met Lys Tyr Cys Ser Glu Tyr Arg Asn Arg Val Leu Ala Leu He Leu tie Oly 120 

GOT GCG AGC AGA ATA AAG CTT CTA CAC AGA ATT GGA TAT CCT TTA GCA AAG ATT CTT GCA 420 

Gly Gly Ser Arg He Lys Leu Leu His Arg He Gly Tyr Pro Leu Ala Lys He Leu Ala 14 0 

TCC ATT GCA TAC AAG AAG TCT TCA AGA TTO GTC OCA GAT CTT TCC TTT OGC AAA AAT GCT 480 

Ser He Ala Tyr Lys Lys ser Ser Arg Leu Val Ala Aap Leu See. Phe Gly Lys Asn Ala 160 

GGT GAA CTT AAA GAO TOO GGA TGG AAA CAG GCA ATG GAT TAT ACA CCC TCC TAC GTG GCA 540 

Gly Glu Leu Lya Glu Trp Gly Trp Lys Gin Ala Met Asp Tyr Thr Pro Ser Tyr Val Ala 1B0 

ATO TAC AOG TAC AGA ACT CTA ACG AAA GTG AAT CTT OAA AAT ATC TTO GAG AAA AXA QAC 600 

Met Tyr Thr Tyr Arg Thr Leu Thr Lys val Asn Leu Glu Asn He Leu Glu Lys He Asp 200 

TGT CCA ACA CTO ATT ATC GTT GGA GAA GAG GAT GCA CTA TTO CCC GTT AGC AAA TCA OTT 660 

Cys Pro Thr Leu He He Val Oly Glu Glu Asp Ala Leu Leu Pro Val Ser Lys Sex Val 220 

GAG CTC AGC AGG AGO AXA GAA AAC TCA AAG CTT GTG ATC ATC CCA AAC TOG GOG CAT TCC 720 

Glu Leu Ser Arg Arg He Glu Asn Ser Lys Leu Val He He Pro Asn Ser Gly His Cys 240 

• 

OTA ATG CTT GAG ACT CCA AST GAG GTT AAT AGA GCA ATG CAC GAA TTC ATT TCT TCA GCA 760 

Val Met Leu Glu Ser Pro Ser Glu Val Asn Arg Ala Net Asp Qlu Pha He Ser Ser Ala 260 

CAG TTC TAA 789 

Gin Phe 263. 
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FXOCRB 4 

Aguifex pyrophtlUB - 28LC 

TTG AGA TTO AGG AAA TTT GAA GAG ATA AAC CTC GTT CTT TOG GGA GGA GCT GCA AAG GGC 60 

Leu Arg Leu Arg Lya Phe Glu Glu He Asa Leu Val Leu Ser Gly Gly Ala Ala Lys Gly 20 

ATA GCC CAC ATA GOT CTT TTO AAA GCT ATA AAC GAG CTC GGT ATA AGG GTO AGG GCT TTA 120 
He Ala Has He Gly Val Leu Lye Ala lie Aan Glu Leu Gly He Arg Val Arg Ala Leu . ' 40 

AGC GGG GTG AGC GCC GGG GCA ATC GTT TOG GTC TTT TAT GCC TCA GGC TAC TCC CCT GAA 180 

Ser Gly Val Ser Ala Gly Ala He Val Ser Val Phe Tyr Ala Ser Gly Tyr Ser Pro Glu 60 

GGG ATG TTC AGC CTT CTO AAG AGG GTA AAC TGG CTG AAG CTG TTT AAG TTC AAG CCA CCT 240 

Gly Met Phe Ser Leu Leu Lys Arg Val Aan Trp Leu Lys Leu Phe Lys Phe Lya Pro Pro 80 

CTO AAG GGA TTG ATA GGG TGG GAG AAG GCT ATA ASA TTC CTT GAG GAA GTT CTC CCT TAC 300 

Leu Lys Gly Leu He Gly Trp Glu Lys Ala He Arg Phe Leu Glu Glu Val Leu Pro Tyr 100 

AGG AGA ATA GAA AAA CTT GAG ATA CCD ACQ TAT ATA TGC GCO ACQ GAT TTA TAC TCG GGA 360 

Arg Arg He Olu Lys Leu Olu He Pro Thr Tyr lie eye Ala Tbr Asp Leu Tyr Ser Gly 120 

AGO CCT CTA TAC CTC TCG GAA GGG ACT TTA ATC CCC GCA CTT CTC GGC AGC TOT GO* ATT 420 

Arg Ala Leu Tyr Leu Ser Glu Gly Ser Leu lie Pro Ala Leu Leu Gly Ser Cya Ala He 140 

CCC GGC ATA TTT GAA CCC GTT GAG TAT AAG AAT TAC TTO CTC GTT GAC GGA GGT ATA GTT 460 

Pro Gly He Phe Glu Pro Val Glu Tyr Lys Aan Tyr Leu Leu val Asp Gly Gly He val 160 

AAC AAC CTT CCC GTT GAG CCC TTT CAG GAA AGC GGT ATT CCC ACC GTT TGC GTT OAT GTC 540 

Asn Asn Leu Pro Val Glu Pro Phe Gin Glu Ser Gly He Pro Thr val Cya Val Asp Val 180 

CTT CCC ATA GAG COG GAA AAG GAT ATA AAG AAC ATT CTT CAC ATC CTT TTG AGG AGC TTC 600 

Leu Pro He Glu Pro Glu Lys Asp He Lye Asn He Leu His Ha Leu Leu Arg Ser Phe 200 

TTT CTT GCO GTC CGC TCA AAC TCC GAA AAG AGA AAG SAG TTT TGT GAC CTC GTT ATA GTT 660 

Phe Leu Ala Val Arg ser Aan Ser Glu Lys Arg Lys Glu Phe Cys Asp Leu Val Xle Val 220 

CCT GAG CTT GAG GAG TTC ACA CCC CTT CAT GTT AGA AAA GCO GAC CAA ATA ATG GAG AGO 720 

Pro Glu Leu Glu Glu Phe Thr Pro Leu Asp Val Arg Lys Ala Asp Oln He Met. Glu Arg 240 

GGA TAC ATA AAG GCC TTA GAG GTA CTT TCT GAA TAG 756 

Gly Tyr He Lys Ala Leu Glu Val Leu Ser Glu 2S2 
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FIGQRB 3 
MHTL-29L. 


ATG ITT AAT ATC AAT GTC TTT OTT AAT ATA TCT TGO CTG TAT TXT TCA GOT ATA OTT ATG 60 

Met Phe Asn He Asn Val Phe Val Aan lie Ser Trp Leu Tyr Phe ser Gly He Val Ket 20 

AAO ACT GTG GAA GftO TAT OCO CTA CTT GAA. ACA GGC CIA AQA GTG TTT. TAT CGO TCT CTA • 120 

Lys Thr Val Olu Olu Tyr Ala Leu Leu Olu Thr cly val Arg Val Phe Tyr Arg Cys Val 40 

ATC CCO GAG AAA GCT TTT AAC ACT TTQ ATA ATA GGT TCA CAC OOA TTO CGO GCG CAC AGT 100 

He Pro Olu Lys Ala Phe Asn Thr Leu He He Gly Ser Hie Gly Leu Gly Ala Hie Ser 60 

GGA ATC TAC ATT AGT GTT GCT GAA GAA TTT GCT AGO CAC GGA TTT GGA TTC TGC ATO CAC 240 

Gly lie Tyr He Ser Val Ala Glu Glu Phe Ala Arg Hia Gly Phe Gly Phe Cya Met His 60 

OAT CAA AGO GGA CAT GGG AGA ACQ OCA AGC GAT AQA GAA AGA GGG TAT GTO GAG GGC TTT 300 

Asp Gin Arg Gly His Gly Arg Thr Ala Ser Asp Arg Glu Arg Gly Tyr Val Glu Gly Me 100 

CAC AAC TTC ATA GAG GAT ATG AAG GCC TTC TCC GAT TAT GCC AAG TOG CGC GTG GGA GOT 360 

His Aan Phe He Glu Asp Met Lys Ala Phe Ser Asp Tyr Ala Lys Trp Arg Val Gly Gly 120 

GAC GAA ATA ATA TTQ CTA GGA CAC AGT ATG GGC GGG CTO ATA GCG CTC TTA ACA GTT GCA 420 

Asp Glu Xle He Leu Leu Gly His Ser Met Gly Gly Leu He Ala Leu Leu Thr Val Ala 140 

ACT TAT AAA GAA ATC GCC AAO GGA GTT ATC GOG CTA GCC COG GCC CTC CAA ATC CCC TTA 400 

Thr Tyr Lys Glu Xle Ala Lys Gly Val Xle Ala Leu Ala Pro Ala Leu Gin Xle Pro Leu 160 

ACC CCO GCT AGA AGA CTT GTT CTA AGC CTC GCG TCA AGO CTT GCC COG CAT TCT AAO ATC 540 

Thr Pro Ala Arg Arg Leu Val Leu Ser Leu Ala Ser Arg Leu Ala Pro His Ser Lys Xle 180 

ACC TTA CAA AGO AGA TTO COG GAG AAA CCA GAG GGT TTT CAA AGA GCA AAA GAT ATA GAA £00 

Thr Leu Oln Arg Arg Leu Pro Oln Lys Pro Glu Gly Phe Gin Arg Ala Lys Asp Xle Glu 200 

TAC AOT CTO AGT GAA ATA TCA CTC AAC CTC OTO GAC GAA ATO ATT AAA GCA TCA TCT ATO 660 

Tyr Ser Leu Ser Glu He Ser Val Lys Leu Val Asp Glu Met Xle Lys Ala Ser Ser Hot 220 

TTC TOG ACC ATA GCA GGG GAA ATT AAT ACT CCC GTC CTG CTT ATT CAT GGG GAA AAA GAC T20 

Phe Trp Thr He Ala Gly Glu Xle Asn Thr Pro Val Leu Leu He His Gly Glu Lya Asp 240 

AAT GTC ATA CCT CCG GAG GCG ACC AAA AAA GCC TAC CAA TTA ATA OCT TCA TTC CCT AAA 780 

Aan val Xle Pro Pro Glu Ala Ser Lys Lys Ala Tyr Gin Leu Xle Pro Ser Phe Pro Lys 260 

GAG TTQ AAA ATA TAC CCC GAT CTT GGA CAC AAC TTC TTT TTT GAA CCA GGC GCG GTG AAA 040 

Glu Leu Lys Xle Tyr Pro Asp Leu Gly His Asn Leu Phe Phe Glu Pro Gly Ala Val Lys 280 

ATC GTC ACA GAC ATT CTA GAG TGC GTT AAG AAT CTA CCC AGO GAA AAT OCT TAA 694 

lie Val Thr Asp lie Val Glu Trp Val Lys Asn Leu Pro Arg Olu Asn Pro 298 
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FXGUftB 6 

Thomococcus CL*2~30LC 

ATO GAG GTT TAC AAG GCC AAA TTC GGC GAA OCA AAG CTC GOC TGG GTC GTT CTO GTT CAT €0 

Met Glu Val Tyr Lys Ma Lys Pbe Gly Glu Ala Lya Leu Gly Trp val Val Leu Val His 20 

GGC CTC GGC GAG CAC AGC GGA AGG TAT OGA AGA CTG ATT AAG GAA CTC AAC TAT GCC GGC 120 

Gly Leu Gly Glu His Ser Gly Arg Tyr Gly Arg Leu lie Lys Glu Leu Asn Tyr Ala Gly 40 

TTT GGA GTT TAC ACC TTC GAC TOG CCC GOC CAC GOG AAG AGC CCG GGC AAG AGA GGG CAC ISO 

Phe Gly Val Tyr Thr Phe Asp Trp Pro Gly Hie Gly Lye Ser Pro Gly Lys Arg Gly His 60 

AGO AGC GTC GAG GAG GCG ATO GAA ATC ATC GAC TCO AXA ATC GAG GAG ATC AGG GAG AAG 240 

Thr Ser Val Glu Glu Ala Met Glu lie lie Aep Sar lie lie Glu Glu lie Arg Glu Lye 00 

CCC TTC CTC TTC GGC CAC AGC CTC GOT GGT CIA ACT GTC ATC AGG TAC GCT GAG AOG CGG 3.00 

Pro Abe Leu Phe Gly Hi a Ser Leu Gly Gly Leu Thr Val He Arg Tyr Ala Glu Thr Arg 100 

CCC GAT AAA ATA CGG GGA TEA ATA OCT TCC TCG CCT GCC CTC GCC AAG AGC CCG GAA AOG 360 

Pro Asp Lys lie Arg Gly Leu lie Ala Ser Ser Pro Ala Leu Ala Lys Sex Pro Glu Thr 120 

CCG GGC TTC ATO GTG GCC CTC GCG AAG TTC CTT GGA AAG ATC GCC CCG GGA GTT GTT CTC 420 

Pro Gly Phe Met val Ala Leu Ala Lys Phe Leu Gly Lys Zle Ala Pro Gly Val Val Leu 140 

TCC AAC GGC ATA AAG CCG GAA CTC CTC TOG AGG AAC AGG GAC GCC GTG AGG AGG TAC GTT 400 

Ser Aan Gly He Lys Pro Glu Leu Leu Ser Arg Asn Arg Aep Ala Val Arg Arg Tyr Val 160 

GAA GAC CCA CTC GTC CAC GAC AGG ATT TCG GCC AAG CTG GGA AGG AGC ATC TTC GTG AAC 540 

Olu Asp Pro Leu Val His Asp Arg lie Ser Ala Lys Leu Gly Arg Ser lie Phe Val Asn lab 

ATO GAG CTG GCC CAC AGO GAG GCG GAC AAG ATA AAA GTC CCG ATC CTC CTT CTG ATC GGC 600 

Met Glu Leu Ala His Arg Glu Ala Asp Lys He Lys Val Pro He Leu Leu Leu He Gly 200 

ACT GGC GAT GTA ATA ACC CCG CCT GAA GGC TCA CGC AGA CTC TTC GAG GAG CTG GCC GTC 660 

Thr Gly Asp Val Zle Thr Pro Pro Glu Gly Ser Arg Arg Leu Phe Glu Glu Leu Ala Val 220 

GAG AAC AAA ACC CTG AGG GAG TTC GAG GGG GOO TAC CAC GAG ATA TTT GAA GAC CCC GAG 720 

Glu Asn Lys Thr Leu Arg Glu Phe Glu Gly Ala Tyr His Glu He Phe Glu Asp Pro Glu. 240 

TGG GCC GAG GAG TTC CAC GAA AGA ATT GTT AAG TGG CTG GTT GAA AAA TCG TAC TCT TOG 700. 

Trp Ala Glu Glu Phe His Glu Thr He Val Lys Trp Leu Val Glu Lya Ser Tyr Ser Ser 260; 

GCT CAA TAA 789 

Ala Gin 263 
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FIGURE 7 
Aguit&X VF5-34LC 

TTO ATT GGC AAT TTO AAA TTG AAG AGG TTT QAA OAS OTT AAC TTA QTT CTT TCO OGA GGG 60 

Leu lie Oly Asn Leu Lys Leu Lys Arg Phe Qlu Glu Val Asa Leu Val Leu Ser Oly oiy 20 

OCT GCC AAO OCT ATC GCC CAT ATA GOT OTT TTA AAA OCT CTG GAA GAG CTC GOT ATA AAG 120 

Ala Ala Lys Gly lie Ala His lie Gly val Leu Lys Ala Leu Olu Glu. Leu Oly lie Lys 40 

OTA AAG AGG CTC AGC GGG OTA AGT OCT GGA OCT ATC CTT TCC CTC TTT TAC GCT TOG GGC 180 

Val Lys Arg Leu Ser Gly Val Ser Ala Gly Ala He Val Ser val Phe Tyr Ala Ser Oly 60 

TAC ACT CCC GAC GAG ATG TTA AAA CTC CTO AAA GAG GTA AAC TOG CTC AAA CTT TTT AAG 240 

Tyr Thr Pro Asp Olu Met Leu Lys Leu Leu Lys Glu Val Asa Trp Leu Lys Leu Phe Lys 80 

TTC AAA ACA COG AAA ATG GGC TTA ATG GGG TGG GAG AAG GCT GCA GAG TTT TTO GAA AAA 300 

Phe Lys Thr Pro Lye Met Gly Leu Met Gly Trp Glu Lye Ala Ala Glu Phe Leu Glu x*ys 100 

GAG CTC GGA GTT AAO AGG CTG GAA GAC CTO AAC ATA CCA ACC TAT CTT TGC TCO GCG GAT 360 

Glu Leu Gly Val Lys Arg Leu Glu Asp Leu Asa He Pro Thr Tyr Leu Cys Ser Ala Asp 120 

CTO TAC ACQ GGA AAG GCT CTT TAC TTC GGC AGA GOT GAC TTA ATT CCC CTG CTT CTC GGA 420 

Leu Tyr Thr Oly Lys Ala Leu Tyr Phe Gly Arg Gly Asp Leu Xle Pro Val Leu Leu Gly 240 

AGT TOT TCC ATA CCC GGG ATT TTT GAA CCA OTT GAG TAC GAG AAT TTT CTA CTT GTT GAC 480 

Ser Cys Ser lie Pro Oly He Phe Glu Pro val Glu Tyr Glu Asn Phe ueu Leu Val Asp 160 

GGA GOT ATA CTG AAC AAC CTG CCC GTA GAA OCT TTG GAA AAO TTC AAA GAA CCC ATA ATC 540 

Gly Gly Xle Val Asn Asn Leu Pro val Glu Pro Leu Glu Lys Phe Lys Glu Pro Zle He 180 

GGG GTA GAT OTO CTT CCC ATA ACT CAA GAA ABA AAG ATT AAA AAT ATA CTC GAC ATC CTT 600 

Gly val Asp Val Leu Pro Zle Thr Gin Qlu Arg Lys He Lys Asn zle Leu His Zle Leu 200 

ATA AGG AGC TTC TTT CTG GCG GTT COT TCC AAT TOG GAA AAO AGA AAG OAO TTC TGC AAC 660 

He Arg Ser Phe Phe Leu Ala Val Arg ser Asn Ser Glu Lys Arg Lys Glu Phe Cys Asn 220 

GTA GTT ATA GAA CCT CCC CTT GAA GAG TTC TCT CCT CTG GAC OTA AAT AAO GCG GAC GAG 720 

Val Val He Glu Pro Pro Leu Glu Glu Phe Ser Pro Leu Asp Val Asn Lys Ala Asp Olu 240 

ATA TTC TOC GGG GAT ATG AGA GCA CTT TAA 750 

He Phe Cys Gly Asp Mot Arg Ala Leu 250 
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rcouRH s 

Terwlinibacter - 42L 


ATO CCA CCT AAT GAC TCA CCC AOS ATC GAC TTT AAT CCT CGC OOC ATT CTT CGC AAC OCT $0 

Met Pro Ala Aon Asp Ser Pro Thr I la Asp Phe Asa Pro Arg Gly He Leu Arg Am Ala 20 

CAC CCA CAG GTT ATT TTA GCO ACT TCC GGC TTG CGC AAA CCC? TTT TTO AAA CCC ACG CAC 120 

His Ala Gin Val Xle Leu Ala Tbr £ef Gly Leu Arg Lys Ala Phe Leu Lys Arg Thr Hia 40 

AAO AGC TAC CTC AGC ACT GCC CAA TOQ CTG GAG CTC OAT GCC GGC AAC OGA GTT ACC TTG 180 

Lya Ber Tyr Leu Ser Thr Ala Gin Trp Leu Glu Leu Asp Ala Gly Aan Gly Val Thr Leu 60 

GCC GGA GAG CTT AAC ACA GCG CCT OCA ACT GCA TCC TCC TCC CAC COO GCG CAC AAG AAC .240 

Ala Gly Glu Leu Aan Thr Ala Pro Ala Thr Ala Ser Ser Ser Hid Pro Ala Hid Lya Asn BO 

ACT CTQ GTT ATT GTG CTG CAC GGC TOG GAA GGC TCC AGC GAG TCG GCC TAT GCG ACC TCC 300 

Thr Leu val lie Val Leu Hla Gly Trp Glu Gly Ser Ser Gin Ser Ala Tyr Ala Thr Ser 100 

OCT GGC AGC ACG CTT TTC GAC AAT GGC TTC GAC ACT TTT CGC CTT AAT TTT CGC GAT CAC 3*0 

Ala Oly Ser Thr Leu Phe Asp Aan Gly Phe Aap Thr Phe Arg Leu Aan Phe Arg Asp His 120 

GGC GAC ACC TAC CAC TTA AAC CGC GGC ATA TTT AAC TCA TCG CTG ATT GAC GAA GTA CTO 420 

Gly Aap Thr Tyr His Leu Aan Arg Gly He Phe Aan Ser Ser Leu He Asp Glu Val Val 140 

GGC GCA GTC AAA GCC ATC CAG CAG CAA ACC GAC TAC GAC AAG TAT TGC CTG ATO GGO TTC 400 

Oly Ala val Lya Ala He Gin Gin Gin Thr Asp Tyr Aap Lya Tyr Cya Leu Met Oly Phe 160 

TCA CTO GOT GGO AAC TTT GCC TTO CGC GTC GCO GTG CGG GAA CAG CAT CTC GCT AAA CCG 540 

Ser Leu Gly Gly Asn Phe Ala Leu Arg val Ala Val Arg Glu Gin His Leu Ala Lya Pro 180 

CTA GOG GGC GTG CTC GCC GTA TGC CCG GTA CTC GAC CCC GCA CAC ACC ATG ATO GCC CXA 600 

Leu Ala Gly Val Leu Ala val Cya Pro Val Leu Aap Pro Ala His Tbr Met Met Ala Leu 200 

AAC CGA GOT GCG TTT TTC TAC GGC CSC TAT TTT GCO CAT AAA TGG AAG CGC TCG TTA ACC 660 

Aan Arg Gly Ala Phe Phe Tyr Oly Arg Tyr Phe Ala Hla Lys Trp Lya Arg Ser Leu Thr 220 

GCA AAA CTT GCA GCT TTC CCA GAC TAC AAA TAC GGC AAA GAT TTA AAA TCG ATA CAC ACG 720 

Ala Lye Leu Ala Ala Phe Pro Aap Tyr Lys Tyr Gly Lys Aap Leu Lya Ser He His Thr 240 

CTT GAT GAG TTA AAC AAC TAT TTC ATT CCC CGC TAC ACC GGC TTC AAC TCA GTC TCC GAA 780 

Leu Aap Glu Leu Aan Aan Tyr Phe He Pro Arg Tyr Thr Gly Phe Aan Ser Val Ser Glu 260 

TAC TTC AAA AGT TAC ACG CTC ACC GGG CAG AAG CTC GCG TTT CTC AAC TGC CCC AGT TAC 840 

Tyr Phe Lya Ser Tyr Thr Leu Thr Gly Gin X»ys Leu Ala Phe Leu Aan Cya Pro Ser Tyr 280 

ATT CTG GCA GCT GGC GAC GAC CCA ATA ATT CCA GCA TCC GAC TTT CAG AAA ATA GCC AAO 900 

He Leu Ala Ala Gly Aap Asp Pro He zie Pro Ala ser Aap Phe Gin Lys He Ala Lya 300: 

CCT GCO AAT CTG GAC ATA ACA GTA ACG CAA CAA GOT TCT CAT TGC GCA TAC CTG GAA AAC 960. 

Pro Ala Aon Leu Bis He Thr Val Thr Gin Gin Gly Ser Hla Cya Ala Tyr Leu Glu Aan 320 

CTG CAT AAA CCT AGT GCT GCC GAC AAA TAT GCG GFTG AAA TTA TTT.GGA GCC TOT TGA 1017 

Leu Bis Lya Pro Ser Ala Ala Aap Lys Tyr Ala Val Lys Leu Phe Gly Ala Cya Bnd 339 - 
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FXOURB 9 

Archeo&lobUB tulgidas VClff - 16HC1 

ATG CTT GAT ATG CCA ATC GAC CCT GTT TAC TAC CAG CTT GCT GAG TAT 
Met Leu Asp Met Pro lie Asp Pro Val Tyr Tyr Gin Leu Ala Glu Tyr 
1 S 10 15 

TTC GAC ACT CTG COG AAG, TTC GAC CAG TTT TCC TCG GCC AGA GAG TAC 
Pne Asp Ser Leu Pro Lys Phe Asp Gin Phe Scr Ser Ala Arg Glu Tyr 
20 25 30 

AGG GAG GCG ATA AAT CGA ATA TAC GAG GAG AGA AAC CGG CAG CTG AGC 
Arg Glu Ala lie Asn Arg lie Tyr Glu Glu Arg Asa Arg Gin Leu Ser 
35 40 45 

CAG CAT GAG AGG GTT GAA AGA GTT GAG GAC AGG ACS ATT AAG GGG AGG 
Gin His Glu Arg Val Glu Arg Val Glu Asp Arg Thr lie Lys Gly Arg 
50 55 60 

AAC GGA GAC ATC AGA GTC AGA GTT TAC CAG CAG AAG CCC GAT TCC CCG 
Asn Gly Asp lie Arg Val Arg Val Tyr Gin Gin Lye Pro Asp Ser Pro 

€5 70 75 SO 

GGT CTG GTT TAC TAT CAC GOT GOT GGA TTT GTG ATT TGC AGC ATC GAG 
Val Leu Val Tyr Tyr His Gly Gly Gly Phe Val lie Cys Ser lie Glu 
65 90 95 

TCG CAC GAC GCC TTA TGC AGG AGA AYY GCG AGA CTT TCA AAC TCT ACC 
Ser His Asp Ala Leu Cys Arg Arg He Ala Arg Leu Ser Asn Ser Thr 
100 1Q5 110 

GTA GTC TCC GTG GAT TAC AGG CTC OCT CCT GAG CAC AAG TTT CCC CCC 
Val Val Ser Val Asp Tyr Arg Leu Ala Pro Glu His Lys Phe Pro Ala 
115 120 125 

CGA GTT TAT CAT TGC TAC GAT GCG ACC AAG TGG GTT GCT GAG AAC CGG 
Ala Val Tyr Asp Cys Tyr Asp Ala Thr Lys Trp Val Ala Glu Asn Ala 
130 135 140 

GAG GAG CTG AGG ATT GAC CCG TCA AAA ATC TTC GTT GGG GGG GAC ACT 
Glu Glu Leu Arg He Asp Pro Ser Lys Xle Phe Val Gly Gly Asp Ser 
145 150 155 «0 

GCG GGA CGG AAT CTT GCC CCG GCG CTT TCA ATA ATG GCG AGA GAC AGC 
Ala Gly Gly Asn Leu Ala Ala Ala Val Ser Xle Met Ala Arg Asp Ser 
165 170 17S 

GGA GAA GAT TTC ATA AAG CAT CAA ATT CTA ACT TAC CCC GTT GTG AAC 
Gly Glu Asp Phe He Lys His Gin Xle Leu Xle Tyr Pro Val Val Asn 
180 X85 190 

TTT GTA GCC CCC AGA CCA TCG CTT CTG GAG TTT GGA GAG GGG CTG TGG 
Phe Val Ala Pro Thr Pro Ser Leu Leu Glu Phe Gly Glu Gly Leu Trp 
195 200 205 

ATT CTC GAC CAG AAG ATA ATG ACT TGG TTC TCG GAG CAG TAC TTC TCC 
Xle Leu Asp Gin Lys Xle Met Ser Trp Phe Ser Glu Gin Tyr Phe Ser 
210 215 230 

AGA GAG GAA GAT AAG TTC AAG CCC CTC GCC TCC GTA ATC TTT GCG GAC 
Arg Glu Glu Aso Lys Phe Asn Pro Leu Ala Ser Val Xle Phe Ala Asp 
23S 240 245 250 
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Clf GAG AAC CIA CCT CCT GCG CTG ATC ATA ACC GCC GAA TAC QAC COG 
Leu Glu Asa Leu Pro Pro Ala Leu lie He Thr Ala Glu Tyr Asp Pro 
255 260 265 

CIO AGA GAT GAA GGA GAA GTT TTC GGG CAG ATO CTO AGA AGA GCC GGT 
Leu Arg Asp Glu Gly Glu Val Phe Gly Gin Met Leu Arg Arg Ala Gly 
270 275 200 

GTT GAG GCG AGC ATC GIC AGA TAC AGA GGC GTG CTT CAC GGA TTC ATC 
Val Glu Ala Ser He Val Arg Tyr Arg Gly Val Leu His Gly Phe He 
265 290 255 

AAT TAC TAT CCC GTG CTG AAG GCT GOG AGG GAT GCG ATA AAC CAG ATT 
Asa Tyr Tyr Pro Val Leu Lye Ala Ala Arg Asp Ala He Asn Gin He 
300 305 310 

GCC GCT CTT .CTT GTG TTC GAC TAG 
Ala Ala Leu Leu Val Phe Asp 
315 320 
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FX08BS 10 
SulfolobuQ Saltat&rlcus PI - 8LC1 

ATG CCC CTA GAT CCT AGA ATT AAA AAG TTA CTA GAA TCA GCT CTT ACT 
Met Pro Leu Asp Pro Arg He Lys Lys Leu Leu Glu Ser Ala Leu Thr 
1 5 io 15 

ATA CGA ATT GGT AAA GCC CCA CTA GAA GAG GTA AGA AAG ATA TTT AGO 
lie Pro lie Gly Lys Ala Pro Val Glu Glu Val Arg Lys He Phe Arg 
20 25 30 

CAA TTA GCG TCG GCA GCT CCC AAA GTC GAA GTT GGA AAA GTA GAA GAT 
Gin Leu Ala Ser Ala Ala Pro Lys Val Glu Val Gly Lys Val Glu Asp 
35 40 45 

ATA AAA ATA CCA GGC AGT GAA ACC GIT ATA AAC GCT AGA GTG TAT TTT 
He Lya He Pro Gly Ser Glu Thr Val Zle Asn Ala Arg Val Tyr Phe 
50 55 60 

CCO AAG AGT AGC GGT CCT TAT GGT GTT CTA GTG TAT CTT CAT GGA GGC 
Pro Lys Ser Ser Gly Pro Tyr Gly Val Leu Val Tyr Leu His Gly Gly 
65 74 75 80 

GGT TTT GTA ATA GGC GAT GTG GAA TCT TAT GAC CCA TTA TGT AGA GCA 
Gly Phe Val He Gly Asp Val Glu Ser Tyr Asp Pro Leu Cys Arg Ala 
85 SO 95 

ATT ACA AAT GCG TGC AAT TGC GTT GTA GTA TCA GTG GAC TAT AGG TTA 
He Tnr Asa Ala Cys Asn Cys Val Val Val Ser Val Asp Tyr Arg Leu 
100 105 110 

GCT CCA GAA TAC AAG TTT CCT TCT GCA GTT ATC GAT TCA TTT GAC GCT 
Ala Pro Glu Tyr Lys Phe Pro Ser Ala Val He Asp Ser Phe Asp Ala 
115 X20 125 

ACT AAT TGG GTT TAT AAC AAT TTA GAT AAA TTT GAT GGA AAG ATG GGA 
Thr Aan Trp Val Tyr Asn Asn Leu Asp Lys Phe Asp Gly Lys Met Gly 
130 135 140 

GTT GCG ATT GCG GGA GAT AGT GCT GGA GGA AAT TTG GCA GOG GTT GTA 
Val Ala He Ala Gly Asp Ser Ale Gly Gly Asn Leu Ala Ala Val Val 
145 150 155 160 

GCT CTT CTT TCA AAG GGT AAA ATT AAT TTG AAG TAT CAA ATA CTG GTT 
Ala Leu Leu Ser Lys Gly Lys He Asn Leu Lys Tyr Gin He Leu Val 
165 170 175 

TAC CCA GCG GTA AGT TTA GAT AAC GTT TCA AGA TCC ATG ATA GAG TAC 
Tyr Pro Ala Val Ser Leu Asp Asn Val Ser Arg Ser Met He Glu Tyr 
180 185 190 

TCT GAT GGG TTC TTC CTT ACC AGA GAG CAT ATA GAG TGG TTC GGT TCT 
Ser Asp Gly Phe Phe Leu Thr Arg Glu His He Glu Trp Phe Gly Ser 
195 200 205 

CAA TAC TTA CGA AGC CCT GCA GAT TTG CTA GAC TTT AGG TTC TCT CCA 
Gin Tyr Leu Arg Ser Pro Ala Asp Leu Leu Asp Phe Arg Phe Ser Pro 
210 215 220 
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ATT CTG GOG CAA GAT TTC AAC GGA TTA CCT CCA GCC TTG ATA ATA ACA 
lie Leu Ala Gin Asp Phe Asn Gly Leu Pro Pro Ala Leu He He Tbr 
225 230 235 240 

GCA GAA TAG GAT CCA CTA AGG GAT CAA GGA GAA GCG TAT GCA AAT AAA 
Ala Glu Tyr Asp Pro Leu Arg Asp Gin Gly Glu Ala Tyr Ala Asn Lys 
245 _ 250 255 

CTA CTA CAA GCT GGA GTC TCA GTT ACT AGT GTG AGA TTT AAC AAC GTT 
Leu Leu Gin Ala Gly Val Ser Val Thr Ser Val Arg Phe Asn Asn Val 
260 26S 270 

ATA CAC GGA TTC CTC TCA TTC TTT CCG TTG ATG GAG CAA GGA AGA GAT 
He His Gly Pbe Leu Ser Phe Phe Pro Leu Met Glu Gin Gly Arg Asp 
275 260 285 

GCT ATA GGT CTG ATA GGG TCT GTG TTA AGA CGA GTA TTT TAT GAT AAA 
Ala He Gly Leu He Gly Ser Val Leu Arg Arg Val Phe Tyr Asp Lys 
290 295 300 

ATT TAA 

He 

305 
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figure XI 
UUX.l Bsteraoe ea33 

ATS AAG OTT AAA CAC GTT ATT GTT TTA CAT GGC TEA TAT AT6 TCT GGC TTG GTO ATG COC €0 

Mat Lye Val Lye Hia Val lie Val Leu Hia Gly Leu Tyr Net Ser Gly Leu Val Met Arg 20 

CGG TTA TCT TOO CGT CTA OAA GAO TCG GGG OTT AAA GTT TTA AAC TTA ACC TAC AAT ACT : 120 

Pro Leu Cys Ser Arg Leu Glu Glu Ser Gly val Lys val Leu Aan Leu Thr Tyr Asn Thr 40 

CQA GAC CCT AAT CGA GAT GCT ATT TTT ACQ CAA ATA GAT GAO TTT ATT AGC AAT GAG CCT 180 

Arg Asp Pro Asa Arg Asp Ala He Phe Thr Gin He Asp Glu Phe lie Ser Aon Glu Pro ' 60 

TCT GCT TTA OTG TOT CAC TCT ATG GGG GGC TTA GTT GCT COC GCC TAT TTA GAG GCA AAC 240 

Ser Ala Leu Val Cys Hia Ser Met Gly Gly Leu Val Ala Arg Ala Tyr Leu Glu Ala Aan 80 

TCA GCG CCA AGT CAT CAT CTT GAA AAG GTA ATC ACC TTA GGA ACQ CCA CAT ACT GGC AGC 300 

Ser Ala Pro Ser Bis Bis Val Glu Lys Val He Thr Leu Gly Thr Pro Hia Thr Gly Ser 100 

CAT ATT GCT GAA AAA AIG CAfi CAA AAA GGC TTC GAG CTA TTA TTA AAA AAT AGC GTT GAG 360 

Hia He Ala Glu Lyo Met Gin Gin Lys Gly Phe Glu Leu Leu Leu Lys Asn Ser Val Glu 120 

TTT TTA CTC TCT AAG AAT GOT GAT TOO CCT TTT AAA GCC AAG CTA TAT AGC ATT GCC GCC 420 

Phe Leu Leu Ser Lyo Aen Oly Asp Trp Pro Phe Lye Ala Lys Leu Tyr Ser He Ala Oly 140 

GAC TTA CCO ATT GGC TTA ATG CCA CTC ATT GTA AAA GGC AGC COC TCT GAT GGC ACT GTA 480 

Asp Leu Pro He Gly Leu Met Pro Leu He Val Lys Gly Ser Arg Ser Asp Gly Thr val 160 

TTO CTA GAT GAA ACC AAG CTA AAG GOT ATG GCT GAA CAC AAG GIG TTT CAT TTA AGC CAT 540 

Leu Lsu Asp Glu Thr Lys Leu Lys Oly Met Ala Glu Hie Lys Val Phe Bis Leu Sex Hie XBO 

AGA AGT ATG ATT TAC TCT COC CAA GTC GTT AAT TAT ATT CTT GAG CGC TTG AAC GAG GAC 500 

Thr Ser Met He Tyr Ser Arg Gin Val Val Asn Tyr He Leu Glu Arg Leu Asn Glu Asp 200 

ATT TA SOS 

He 201 
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Figure 12 

Whale Mat Sasple 11*801 Esterase eeS 

AT3 ATA AAA AAC TTC GAC AGA GAA AAT TCT AOC TTA OTA CTO TCC OCT GGT GOT OCT CPS 60 

Met He Lys Aon Phe Asp Arg Glu Asn Ser ser Leu Val Leu ser Gly Oly Gly Ala Leu 20 

GGT ATT OCT CAC TTO GOT OTA CTO CAT QAC CTT QAA AAA CAA AAT ATT OTA CCA AAT GAA 120 

Oly lie Ala Hie Leu Oly Val Leu Hie Asp Leu Olu Lys Gin Aan He Val Pro Asn Glu * 40 

ATT CTT GGT ACA ACT ATO GOT GOT ATC ATT GGT GCA TCT ATO OCT ATC OGG ATG AAA GAG 180 

He Val Gly Thr Ser Met Gly Oly He He Oly Ala Ser Het Ala He Oly Met Lys Olu 60 

AAA GAA ATA CTC GAA QAA ATC AAA AAC TTT TCC AAT OTC TTC AAC TGO ATA AAA TTC TCT 240 

Lys Olu He Leu Olu Olu He Lya Aan Phe Ser Asn Val Phe Am Trp lie Lys Pbe Ser 00 

TTT TCC GOT AAT TCT 0TT OTC W AAC OAO AAA ATC OCT AAG ASA TTT GAT ACT CTT TTT 300 

Phe Ser Oly Aan Ser Val Val Asp Aan Olu Lye Ue Ala Lys He Phe Aep Thr Leu Phe 100 

AAA QAC AOA AAG ATO ACA GAT ACQ GTG ATC OCT CTT AAA CTC ATC GCT ACA AAC TTA CAT 360 

Lya Aep Arg Lys Met Thr Aep Thr Val Ue Pro Leu Lye Leu He Ale Thr Asn Leu His 120 

AAT GGA CAT AAA AAA OTA TTT ACT OCT TOO OAT OAT OTA CIO ATC AAA OAT GCA ATA CTC 420 

Asn Gly His Lye Lys Val Phe Thr Ala Ser Aep Asp Val Leu He Lys Aep Ala He Leu 140 

TCA ACA ATO CCA ATA CCC GOT GTA TTT GAA GAA CAT ATT ATT OAT GOT GAA ACC TAT GGC 480 

Ser Thr Met Ala He Pro Gly val Phe Glu Olu His He He Asp Gly Glu Thr Tyr oly X60 

GAC GOT TTT CTT TOT GAA AAC CTT GOT CTO AAT GAO OCA ACA TTC AAT OAT GTT TTA GCT 540 

Asp Gly Phe Leu Cys Glu Aan Leu Gly Val Asn Olu Ala Thr Phe Asn Asp Val Leu Ala 180 

GTA OAT OTC ATO GOT GAG AAC TCT TTT GAA AAA GCA ATO CCQ GAC AAC TTC TTT AAA ACA 600 

Val Aep Val Met Gly Glu Aon Ser Phe Glu Lye Ala Met Pro Asp Asn Phe Phe Lys Thr 200 

TCA AAT GTT TTA GAA ATG TTT GAA AAA TCA ATG CGA CTT TTT ATT TAC AAC CAG ACA CAG 660 

ser Asn Val Leu Glu Met Phe Glu Lys Ser Met Arg Leu Phe He Tyr Asn Gin Thr Gin 220 

ACA CAT ATT AAA AAT GCA AAT AAA AAT ATT TAT CTT ATT GAA CCC GTT AOC AAA GAO TAT 720 

Thr His He Lys Asn Ala Asn Lys Asn He Tyr Leu He Olu Pro Val Thr Lys Glu Tyr 240 

AAA ACA TTT CAA TTT CAT AAA CAT AAA GAG ATA COT GCT TTA GGC TTO GGT TTA CTO TO 779 

Lye Thr Phe Gin Phe His Lys Hie Lys Glu He Arg Ala leu Gly Leu oly Leu Leu 259 
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Figure 23 

Hetallesphaera Prunes Pen 12/2 Esterase 23mci 

ATO CCC CIA CAT OCA AAG GTA AAG AAA TTA CTT TCC CAO CTA CCT CCC CAO GAC TTC TCC 60 

Met Pro Leu Rio Pro Lys Val Lys Lys Leu Leu Sex Gin Leu Pro Pro Gin Asp Phe ser 20 

ASA AAC GTO CAO OAC CTO ACQ AAG OOC TOO GAT TTA CCC TTC TCA GGG AGG AGO GAG ACC 120 

Arg Asn Val Gin Asp Leu Arg Lys Ala Trp Asp Leu Pro Phe Ser Gly Arg Arg alu Thr 40 

CTG AAG AGO GTT GAG OAC CTT GAG ATA CCC ACT AGO GAC GCA CBA ATC AGG GCC AGO GTC 180 

Leu Lys Arg Val Glu Asp Leu Glu lie Pro Thr Arg Asp Ala Arg lie Arg Ale Arg Val 60 

TAC ACC CCC TCA AGT AAG QAA AAC TTA CCC GTC CTT GTT TAC TAT CAC GGC GOT GGC TTC 240 

Tyr Thr Pro Ser Ser Lys Glu Asn Leu Pro val Leu Val Tyr Tyr His Gly Gly Gly Pbe • so 

GTG TTC GOT ACC GTT GAC AGC TAC GAC GGC CTC GCA TCC CTT ATT GCC AAC GAA TCP GGG 300 

Val Phe Gly Ser Val Asp Ser Tyr Asp Gly Leu Ala Ser Leu He Ala Lys Glu Ser Gly 100 

ATT GCO GTT ATC TCC GTG GAG TAT AGG CTC GCC CCT GAG CAC AAG TTC CCC ACC GCA GTC 360 

lie Ala Val lie Ser Val Olu Tyr Arg Leu Ala Pro Glu His Lys Phe Pro Thr Ala Val 120 

AAC GAC TOO TOG GAT GOT CTT CTC TGG ATC GOG GAG AAC GGA GGC AAG CTO GGG CTC GAC 420 

Asa Asp Ser Trp Asp Ala Leu Leu Trp He Ala Glu Asn Gly Gly Lys Leu Gly Leu Asp 140 

ACC TOO AQA CTT GCC GTG CCT GGG GAT AGT OCT GGA GGA AAC CTO TCT GCC GTO GTG TCC. 480 

Thr Ser Arg Leu Ala Val Ala Gly Asp Ser Ala Gly Gly Asn Leu Ser Ala Val Val Ser 16O 

CTC CTO GAC AGG GAC CAO GGT AAG GGA CTG GTT AGT TAT CAO GTC CTA ATC TAC CCA GCA. &40 

Leu Leu Asp Arg Asp Gin Gly Lys Gly Leu Val Ser Tyr Gin Val Leu Zle Tyr Pro Ala ISO 

GTG AAC ATO GTC GAT AAC TCC CCA TCC GTC AGO GAG TAC GGC GAO GGA TAC TTC CTC ACC 600 

val Asn Het Val Asp Asn Ser Pro Ser val Arg Glu Tyr Gly Glu Gly Tyr Phe Leu Thr 200 

AGO TCC ATG ATO AAC TOO TTC GGG ACC ATO TAC TTC TCC TCT GOA AGO GAA GOO OTA TCC 660 

Arg Ser Met Met Aan Trp Phe Gly Thr Met Tyr Phe Ser Ser Gly Arg Glu Ala Vol Ser 220 

CCC TAC GCC TCT CCA GCC TTG OCT GAC CTA CAT AAC CTC CCA CCC TCA CTG GTO ATC ACT T20 

Pro Tyr Ala Ser Pro Ala Leu Ala Asp Leu Rls Asn Leu Pro Pro Ser Leu Val Zle Thr 240 

GCA GAG TAT GAT CCC CTA AGG GAT CAO GGA GAO ACC TAC TCT CAC TCC CTA AAC GAG GCT 780 

Ala Glu Tyr Aep Pro Leu Arg Asp Gin Gly Glu Thr Tyr 9er His Ser Leu Asn Glu Ala 260 

GGA AAC GTA TCA ACC TTG GTT AGA TAT GAA GGA ATG ATT CAC GGC TTC CTO TCC TTC TAC 840 

Gly Asn Val Ser Thr Leu Val Arg Tyr Gin Gly Met Zle His Oly Phe Leu Ser Phe Tyr 280 

GAG TOG ATA ACT GCC GGT AAA CTA GCC ATT CAC CAC ATT OCT GOG GTT CTG AGA TCT GTC 900 

Glu Trp He Thr Ala Gly Lys Leu Ala Zle His His Zle Ala Gly Val Leu Arg Sex Val 300 

CTT TA 90S 

Leu 301 


150 


EP 0 880 590 B1 


Figure 14 

Thermotoga neepolita&a S068 Bo teraae 5<se4 

OTO GCC TTC TTC GAT ATQ CCC CTT GAG GAA CTO MA AAG TAC OCO CCT GAA AGG TAC GAG 60 

Val Ala Phe Phe Asp Met pro Leu Glu GXu Leu Lys Lys Tyr Arg pro Glu Arg Tyr Glu 20 

GAG AAA GAT TTC GAT GAG TTC TOO AGG GAA AOfc CTT AAA GAA AGC GAA GGA TTC CCT CTG . X20 

Glu Lys Asp Ate Asp Glu Pha Txp Arg Glu Ttar Leu Lye Glu Ser Glu Gly Plm Pro Leu 1. 40 

GAT CCC GTC TTT GAA AAO GTG GAC TTT CAT CTC AAA AC3 GTT GAA ACO TAC GAT GTT ACT ISO 

Asp Pro val Phe Glu Lys Val Aep phe His Leu Lys Thr Val Glu Thr Tyr Asp Val Thr . so 

TTC TCT GGA TAC AGG GGG OU3 AGA ATA AAG GGC TGG CTT CTT GTT COO AAG TTG GOG GAA • 240 

Phe Ser Gly Tyr Arg Gly Gin Arg He Lys Gly Trp Leu Leu Val Pro Lye Leu Ala Glu ao 

GAA AAO CTT CCA TGC GTC GTO CAG TAC ATA GOT TAC AAT GOT GGA AGO GOT TTT CCA CAC 300 

Glu l*ye Leu Pro Cys val val Oln Tyr lie Gly Tyr Asa Gly Gly Arg Gly Phe Pro Bis 100 

GAC TGG CTG TTC TGG COG TCA ATG GOT TAC ATC TOT TTT GTC ATQ GAC ACC AGG GGG CAG 360 

Asp Trp Leu Phe Trp Pro Ser Met Gly Tyr lie Cys Phe Val Met Asp Thr Arg Gly Gin 120 

GGA AGC GGC TGG ATG AAG GGA GAC AGA COG OAT TAC OCT GAG GOT COl GTC GAT CCA CAO 420 

Gly Ser Gly Trp Met Lys Gly Aep Thr Pro Asp Tyr Pro Glu Gly Pro Val Asp Pro Gin 140 

TAC CCC GGA TTC ATG ACG AGG GGC ATT CTG GAT CCO GGA ACC TAT TAC TAC AGG CGA GTC 480 

Tyr Pro Gly Phe Met Thr Arg Gly Xle Leu Asp Pro Gly Thr Tyr Tyr Tyr Arg Arg Val 160 

TTC GTG GAT GOG GTC AGG GOG GTG GAA GCA GCC ATT TCC TTC GOG AGA GTG GAT TCC AGG 540 

Phe Val Aep Ala val Arg Ala Val Glu Ala Ala Xle Ser Phe Pro Arg val Asp Ser Arg 180 

AAG GTG GTG GTG GCC GGA GGC AGT CAG GOT GGG GQA ATC CCC CTT GCG GTG ACT GCC CIO 600 

Lys Val Val val Ala Gly Gly Ser Gin Gly Gly Gly He Pro Leu Ala Val Ser Ala Leu 200 

TCG AAC AGG OTO AAG GCT CTO CTC TGC GAT GTG COG TTT CTG TGC CAC TTC AGA AGO GCC £60 

Ser Asn Arg Val Lys Ala Leu Leu Cys Asp Val Pro Phe Leu Cys His Phe Arg Arg Ala 220 

GTG GAA CTT GTC GAC AGA CAC CCA TAC GTG GAG ATC ACC AAC TTC CTC AAA ACC CAC AGG 720 

Val Gin Leu Val Asp Thr His Pro Tyr val Glu Xle Thr Asn Phe Leu Lys Thr His Arg 240 

GAC AAA GAG GAG ATT GTT TTC AGA ACA CTT TCC TAC TTC GAT GCT GTG AAC TTT GCA GCA 780 

Asp Lys Glu Glu Xle Val Phe Arg Thr Leu Ser Tyr Phe Asp Gly val Asn Phe Ala Ala 260 

AGG GCA AAG GTO CCC GCC CTG TTT TCC GTT GGG CTC ATG GAC ACC ATC TGT CCT CCC TOG 840 

Arg Ala Lys Val Pro Ala Leu Phe Ser Val Gly Leu Met Asp Thr Xle Cys Pro Pro Ser 280 

ACG GTC TTC GCC GCT TAC AAC CAC TAC GCC GOT CCA AAG GAG ATC AGA ATC TAT COG TAC 900 

Thr Val Phe Ala Ala Tyr Asn His Tyr Ala Gly Pro Lys Glu Xle Arg Xle Tyr Pro Tyr 300 

AAC AAC CAC GAA GCT GGA GGT TCT TTC CAG GOV ATT GAG CAG GTO AAA TTC TTG AAO AGA 960 

Asn Asn Bis Glu Gly Gly Gly Ser phe Gin Ala Xle Glu Gin Val Lys Phe Leu Lys Arg 320 

CTA TTT GAG GAA GGC TAG 978 

Leu Phe Glu Glu Gly 326 
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Figure IS 

JSalletaegiua liehealcola Esterase 77&ei 

ATO CGC ACC CTC TCC TTC OQT COO ATO ACC ACA QGQ GOA AGC ATT CAC ATQ GCO ACC WO 60 
Met Arg Thr Leu Ser pita Gly Pro Met Thr Thr 01/ Oly Ser lie nia Mat Ala Thr Met 20 

QAC CTO ATO CGC GOO COO 000 ATO CAO OGO CTO TCA CAO QOC GCC AGO QAO GCC GOO AAC 12o 
Affp Val Hat Arg Gly Pro Oly Wat Oln Arg Leu Ser Ola Oly Ala Ax? Glu Ala Ala Asn 40 

CAC CCC TOO GCO AAO COA CTO OGC COC ATO GOC TAC OCO GCC AAO GOC GCC GTO TAC GCC 180 
Hia Pro Trp Ala Lya Arg Leu Oly Arg Mat Oly Tyr Ala Ala Lye Oly Ala Val Tyr Ala 60 

ATC ATC QOC GTO CTC OCO CTG AAO CTC GOO GCO OGC QAO GOC GOC COO ACC AGO GAC ACC 240 
lie lie Oly Vol lieu Ala Leu Lya Leu Ala Ala Gly Glu Oly Gly Arg tar Thr Asp Ser 80 

CAC OGC GCG GTO AAC ACC GIG GOG CAC GGO CCC TTC GOC GTC GCO CTO CTO GCO GTO CTO 300 
Hia Gly Ala Val Aen Thr Val Ala Hia Gly Pro Phe Gly Val Ala Leu Leu Ala val Leu 100 

GTO GIG OOC CTO CTG CGC TAC GTO GTC TOO AGO TTC GCC CAO GCC TTC GTO GAC ACQ GAG 360 
val val 31y Leu Leu Oly Tyr Val Val Trp Arg Phe Ala Gin Ala Phe Val Asp Tax Olu UO 

QAC AAO OGC TCC GAC GCG AAG GGA ATC GCC AGO COC GCC ATO TAC TTC CTC AOC GCC TGC 420 
Asp Lye Gly Ser Asp Ala Lya Gly lie Ala Thr Arg Ala Net Tyr Pbe Leu Ser Oly Cys 140 

* ATC TAC GOG TOO CTG GCC TTC TTC GOC GOO CAO TCC CTO GTO GGC GCC GOO CAC OOC COG 480 
He Tyr Ala Ser Leu Ala Phe Phe Ala Ala Gla Ser Leu Val Gly Ala Ala Bis Oly Arg ' 160 

AOC AAG GOO AGO CAO GOC TGO ACQ GCC ACQ CTG ATO GAG CAO CCC TTT GGC CGC GTO CTG 540 
ser Lya Gly Thr Ola Gly Trp Thr Ala Thr Leu Wet Glu Gin Pro Phe Gly Arg Val Leu ISO 

G1Q GOO CTO GTO GOG CTO GGC ATC GTO GGC TTC GCG CTO AAO CAO TTC CAC ACC GCO TOO 600 
Val Ala Leu Val Gly Leu Oly lie Val Gly Phe Ala Leu Lya Gin Phe Hia Thr Ala Trp 200 

AAO GOO AAG TTC 0G0 GAG AAG CTC ACC CTC ACC GOA CTG OCT GCC CGO AAO CAO CAC CAC £t>0 
Lya Ala Lys Phe Arg Glu Lya Leu Thr Leu Thr Gly Leu Ala Ala Arg Lya Gin Boa Bis 220 

ATC GAG CGC ATO TGC CAO TTC GOC ATC OCC GCG CGC GOC GTO GTO TTC GCC OTC ATC GOC 720 
lie Olu Arg Het Cya Oln Phe Gly Zle Ala Ala Arg Oly Val Val Phe Ala Val lie Gly 240 

GOC TTC CTC GTC CGC TCC GCC GTO GAC GOG AAC OCC GOC GAG GCC AAO GGC CTO GOA GAG "TOO 
Gly Phs Leu Val Arg Sex Ala Val Asp Ala Aan Pro Oly Olu Ala Lya Oly Leu Gly Glu 260 

GOC CTG OCC GTC OTC GOG AGO CAC COO TCC GGC GAC GTO CTC CTG GGO GTG GTO GOO GCO 840 
Ala Leu Ala val Val Ala Arg Gla Pro Ser Gly Asp Val Leu Leu oly Val Val Ala Ala 280 

GCC CTG GTO GCC TAC GCC GCC TAC CTG TTC CTC CAO GCO COC TAC CGC GAA CTC TAG - 897 

Gly Leu val Ala Tyr Ala Ala Tyr Leu Phe Leu Oln Ala Arg Tyr Arg Glu Leu End- - 299 
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Figure 16 

Whale Hat staple XI. 901 Bo tor as* ea3 

ATQ ACC AAA. TTC CCA ATA CTC TOO GCG TtG ATA ACQ OCA TAC CTG CCO OAA CCT GTG AXO GO 
Met Ser Lya phe Ala He Leu Trp Ala Leu Xle Thr Ala Tyr Leu Pro Glu Pro Val fee 20 

AAA CTC OTA TAT TTA GOO COO CCC GAA ACQ CTT OGO OCA COG AGO CXT GAC OTT AAA OCC 12 0 
Lya Leu Val Tyr Leu Oly Arg Ax? Olu Thr Leu OXy Ala Arg Tbr Leu Asp Val Lya Ala 40 

CAA OCT OTC 090 OGO CTO CCC AAT OCA ACA AGA CCT GTC OGO OTO ATT CCO ACQ CTC GA0 180 

OXd Ala Val Oly Arg Leu Ala Asa Ala Thr Arg Pro Val Oly Val Xle Pro Thr Val Olu 60 

OAA A0C COO AAO ATO AO0 GAT AAA CCC GTT AOC CTT TTT OAT CAO CCC OCC CCC OAA TTA . 240 

Olu 9ar Arg Lye Met Thr Asp Lya Ala Val Sex Leu Phe Asp Oln Pro Ala Pro Glu Leu 80 

TTC COT AAA AAA CAC ATT CAO ATT GAC GOO CCT GAA 000 CCT ATT GAT OCC COT ATT TAC 300 

Phe Arg Lya Lya Asp lie Oln He .Asp Oly Ala. Olu Oly Pro Xle Asp Ala Arg Xle Tyr- 100 

AOC OGC CCT GCA AAA GAT COC CCN CON CCA ATN CtA OTO TAT TTT CAC OOC GOT OOC TOO 360 

Ser Oly Pro Ala Lya Bis Arg Pro Arg Pro Xle Leu Val Tyr Phe Bis Oly Oly Gly Trp 120 

GTT CAO CGC AAT CTO GAC AOC CAT GAC OGO GTT TOC GOC AAO CTG OCA AAA TOO GCQ AAC 420 

Val Gin Oly Asn Leu Asp Ser His Asp Gly Val Cys Oly Lye Leu Ala Lya Trp Ala Asn no 

TOC ATT OTT ATC TCO OTC GAT TAT COT CTA GCQ CCC GAA CAC AAA TTT CCT TOT GOT CCC 480 

Cys lie Val XXe Ser Val Asp Tyr Arg Leu Ala Pro Olu His Lys phe Pro Cys Ala Pro 160 

CTT OAT CCO ATT OCO GCC TAT AAA TCO GTG CGC GCC AAC OCA ACA AAC CTT GGC GGC GAT 540 

Leu Asp Ala Xle Ala Ala Tyr Lys Trp Val Arg Ala Asn Ala Thr Asn Leu Gly Gly Asp 180 

CCT GAA COT ATC GGC GTT GGC GOC GAT ACC GCA GOG GOC AAT CXT OCC GCC OTT OTC TOC 600 

pro Olu Arg Xle Oly val Oly Oly Asp Ser Ala Oly Gly Asn Leu Ala Ala val Val cys 200 

CAA CAA ACC OCC ATO AAC OGC GAO CGC ACA CCA GAT CTO CAA GTC CTG ATC TAT CCO CCO 660 

Oln Ola Thr Ala Met Asn Oly Glu Arg Thr Pro Asp Leu Gin Val Leu Xle Tyr Pro Ala 220 

CTO GAT GCA CGC ATO ATC TCO ACC TCO ATO GAG GAA TTO COT GAT GCC TAC ATC TTO CCO 720 

Leu Asp Ala Arg Met Xle Ser Thx Ser Met Olu Glu Leu Arg Asp Ala Tyr Xle Leu Pro 240 

AAA TCC AGA ATC GAO TAT TTC CTC GGC CTA TAT ACQ COT GGC CCT GAC GAT ATC GAG GAC 7B0» 

Lys Ser Arg Met Glu Tyr Phe Leu Gly Leu Tyr Thr Arg Gly Pro Asp Asp Xle Olu Asp 260 

CTT AGG ATO TCG CCA ATT CTC AGO GAT ACC OTC OCO GAT CAA CCC CAA GCC TCC ATT GTC 840 

Leu Arg Met Ser Pro lie Leu Arg Asp Thr Val Ala Asp Gin Pro Oln Ala eye Xle Val 260 

ACC TOT GOO TTT GAC CCT GOO CGA CGA COO GAA CAC CTA CGC CQA ACQ CTT AAT TCC COA 900 

Thr cys Oly Pha Asp pro Ala Arg Arg Arg Olu His Leu Arg Arg Thr Leu Asn Cys Arg 300 

GOO GAT AGA COT TA 914 

Gly Asp Arg Arg 304 
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Figure l? 

Whale Mat Staple AD3059 Esterase es4 

GTG AOC ATT OCT CTO GQA CTO TTA AAC TOO TIT TTG AAT ACC TTT GAA AAA CCA AAA CTG 60 

val Ser lie Arg Leu Arg Leu Leu Asn Trp Phe Leu Asn Thr Phe Olu Lys Pro Lye Leu 20 

OCC GCO GCC AAA ACO COO GAT GAT TTG CCA AAA TOG TTT GAA TTA AAO GCO AGO TTT TTG 120 

Ala Ala Ala Lys Thr Pro Asp Asp Leu Ax? Lys Ser Phe 61u Leu Lys Ala Arg Phe Leu 40 

TTT CCD GCG CCA COT AAA ACA AGO TTT ACT CAT GAT OTA TTG CAO TGA GCC ATC GGG TCO 180 

Phe Pro Ala Pro Arg Lye Thr Arg Phe Ser His Aop Val Leu Gin Ser Gly lie Oly Ser 6o 

GTA AAT GCC CAO TOG GOO AAA TCC AAA TCT CCA TCT GAT OAC AGO GTA ATC CTO TAT TTT 240 

Val Asn Ala Gin Trp Ala Lys ser Lys Ser Ala Ser Asp Asp Arg Val He Leu Tyr Phe 80 

CAT GGG GGA GOG TAT GTT TTT GOO TCA CCA AAA AOS GAC COT OCA ATQ TTG OCG CGC TTG 300 

Hia Oly Gly Gly Tyr Val Phe Gly Ser Pro Lys Thr His Arg Ala Met Leu Ala Arg Leu 100 

TCO OCA ATO ACA GOT CTT TCT GCO TGC CTT CCA GAT TAT AGO TTG GCA CCA QAG CAC CCA 360 

Ser Ala Met Thr Gly Leu Ser Ala Cys Leu Pro Asp Tyr Arg Leu Ala Pro Olu His Pro 120 

TTT OCA GCC GCO ATC GAA GAT GCA GTT TTA TOO TXT AAA TOT TTA CTA GAG CGA GCA ATC 420 

Phe Pro Ala Ala He Olu Asp Ala Val Leu Ser Tyr Lys Cys Leu Leu Olu Arg Ala He 140 

GAG CCC CAA AAT ATT ATA CTG GGG GGG GAC ACT OCT OCT GGC GGT TTG GTT CTT CCT TTG 480 

Glu Pro Gin Asn lie He Leu Gly Gly Asp Ser Ala Gly Gly Gly Leu val Leu Ala Leu 160 

CTT GCA GAA ATC AAG GCC CAA TCC TTG CCC AAA CCT GOT CGC GTT TTT GCC TTG TOO CCT 540 

Leu Ala Glu He Lys Ala Gin Ser Leu pro Lys Pro Ala Gly Val Phe Ala Leu Ser Pro 160 

TTG OTT GAT TTA TCA TTT TCO GGC CTT TCO TTT TCT AAA AAT GCC CAA ACC GAT OTO ATQ 600 

Leu Val Asp Leu Ser Phe Ser Gly Leu Ser Phe Ser Lye Asn Ala Gin Thr Asp Val Met 200 

TTG CCC GCA TCA CGG CCT GCO GAT ATO GCO ACC TIG TAT TTG GAT GOO GCC GAT GCA GAT 660 

Leu Pro Ala Ser Arg Ala Ala Asp Met Ala Thr Leu Tyr Leu Asp Gly Ala Asp Ala Asp 220 

GAT CCA CGT OCA TOO COO CTG CAO GOO OAT TTT TCT GGC ATO COO CCT GTA TTT CTO ACA 720 

Asp Pro Arg Ala Ser Pro Leu Gin Ala Asp Phe Ser Gly Met Pro Pro Val Phe Leu Thr 2^0 

GCA AGT GAC AGT GAA ATC CTG TTG GAT GAT TGC CTO CGG ATO GOO GAT CAC TTG CGT GCO 780 

Ala Ser Asp Ser Glu He Leu Leu Asp Asp Cys Leu Arg Met Ala Asp Bis Leu Arg Ala 260 

CAA GGT GTC GTT GTG ACA GAC COO ATT GTT GAA AAC CAT CCA CAT GTT TOG CAT ATT TTT 840 

Gin Gly Val Val Val Thr Asp Arg He Val Glu Asn His Pro Bis Val Trp Bis He Phe 280 

CAA CGC CTT CTA CCC GAA GCA GAT CAO GGG CTO CGG GOO ATT GCC GCO TOG ATT AAA CCT 900 

Gin Arg Leu Leu Pro Glu Ala Asp Gin Gly Leu Arg Ala He Ala Ala Trp He Lys Pro 300 

CTT TTA TCA GGT TCA AAC GAA AGC TA 926 

Leu Leu Ser Gly Ser Asn Olu Ser 308 
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Figure 19 

Mieroscllla rurveeeens B*t&raa* 53ac2 

ATO CTT ACA TtT AAT OTT TTA TAT GOT ATO ATO AAA CAA AAA CTA GCA OCA ATT CTC ATC : €0 

net Leu Thr Phe Asn Val Leu Tyr Oly Met Net lye Gin Lys Leu Ala Ala lie Leu Net 2o 

TTT TTA GOO CTA TCA GCA OCA GAG OCT CAA GAC TOO CCT QAC CTA CAG AAA TAT COT ACT 120 

Phe Leu Oly Leu Ser Ala Ala Glu Ala Gin Asp Trp Pro Asp Leu Gin Lys Tyr Arg Ser ' 40 

OCT AAT AAA GAA GCC AAA TTA CTT CCA AA0 GAA AAC C00 AAG GIG GTT TTT ATO GGC AAC 180 

Ala Asn Lye Glu Ala Lys Leu Leu Pro Lye Glu Aan Arg Lys Val Val Phe Met Oly Asn 60 

TCC ATT ACA GAA GCC TOO ATT ACT CAO CGA CCT GAO TTT TTT AOT GAA AAT GGG TTT ATC 240 

Ser lie Thr Glu Ala Trp He Ser Gin Arg Pro oiu Phe Phe Ser Glu Asn Gly Phe tie 80 

GOT C0A GGC ATC AOT GGC CAG ACA ACC CCT CAG ATG TTG TTO AOA TTC CGA CAG GAT GTG 300 

Gly Arg Gly He Ser Gly Gin Thr Thr Pro Gin net Leu Leu Arg Phe Arg Gin Asp Val 100 

ATA GAC CT0 CAG CCA AAG GCT OTA GTQ ATA CTA OCT GOT ACC AAT GAC GTA GCT CAA AAT 160 

He Asp Leu Gin Pro Lys Ala Val Val He Leu Ala Gly Thr Asn Asp val Ala Gin Aon 120 

ACC GGG CC0 ATG ACC ATT GAG GAA TCG CTT GCT AAC ATT AAG TCT ATG GTG GAG CTG GCO 430 

Thr Gly Pro Met Thr He Glu Glu Ser Leu Ala Asn lie Lys Ser Met Val Glu Leu Ala 140 

CAA GCC AAT GGG ATC ACQ CCT GTT TTG TOT ACC GTG CTG CCT GCA GAT COT TTC AGC TOO 480 

Gin Ala Asn Gly He Thr Pro val Leu Cys Thr Val Leu Pro Ala Asp Arg Phe Ser Trp 160 

CGA CCT GAG CTT ACA CCC GCA GAA ACT ATC ATT GCC CTC AAT CAG CTC ATT AAG CAA TAT $40 

Arg Pro Glu Leu Thr Pro Ala Glu Thr He He Ala Leu Aan Gin Leu He Lys Gin Tyr 180 

GCC GAG GCA CAO GGC CTG GCC CTO GTG OaT TAT CAT GCT GCA CTC ACC AAT AAA GGT GGA 600 

Ala Glu Ala Gin Gly Leu Ala Leu Val Asp Tyr His Ala Ala Leu Thr Ass Lys Gly Oly 200 

GGA CTT CCO GTG AAA TAC GGA GAA GAT GGT OTO CAT CCA AAT GTA GCA GGC TAT CAG GTG 660 

Gly Leu Pro val Lys Tyr Oly Olu Asp Gly Val Hie Pro Aan val Ala Oly Tyr Gin Val 220 

ATG GAA AAC ATT GTT TTA CCO GTC ATT TCC AGC GAG TTO GCA AAG CTG AAG TA 713 

Met Glu Asn He Val Leu Pro val lie Ser Ser Glu Leu Ala Lys Leu Lys 237 
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Plgrvro 19 

Thermo toga aaritiaa M988 Esterase 6tcl 

ATG GCC TTC TTC GAT TTA CCA CTC GAA GAA CTG AAG AAA TAT COT CCA GAG CGG TAC GAA 60 

Hot Ala Phe Phe Asp Leu Pro Leu Glu Glu Leu Lys Lys Tyr Arg Pro Glu Arg Tyr Glu • 20 

GAG AAA OAC TTC GAT GAG TTC TGG GAA GAG ACA CTC CCA GAG AGC GAA AAG TTC CCC TTA : 120 

Glu Lys Asp Phe Asp Glu Phe Trp Glu Glu Thr Leu Ala Glu Ser Glu Lys Phe Pro Leu , 40 

OAC CCC QTC TTC GAG AGG ATG GAG TCT CAC CTC AAA ACA GTC GAA COG TAC CAT GTC ACC ISO 

Asp Pro Val Phe Glu Arg Met Glu Ser His Leu Lys Thr Val Glu Ala Tyr Asp Val Thr ' 60 

TTC TCC GGA TAC AGG GGA CAG AGG ATC AAA GGG TGO CTC CTT GTT CCA AAA CTG GAA GAA 240 

Phe Ser Gly Tyr Arg Gly Gin Arg lie Lys Gly Trp Leu Leu val Pro Lys Leu Olu Glu 80 

GAA AAA CTT CCC TCC GTT GTG CAG TAC ATA GGA TAC AAC GGT GGA AGA GGA TTC CCT CAC 300 

Glu Lys Leu Pro Cys Val Val Gin Tyr lie Gly Tyr Asa Gly Gly Arg Gly Phe Pro His 100 

GAC TGO CTG TTC TGG CCT TCT ATO GGT TAC ATA TGT TTC GTC ATG GAT ACT CGA GGT CAG 360 

Asp Trp Leu Phe Trp Pro Ser Met Gly Tyr lie Cys Phe Val Met Asp Thr Arg Gly Gin 120 

GGA AGC GCC TGG CTG AAA GGA GAC ACA COO GAT TAC CCT GAG GGT CCC GTT GAC CCT CAG 420 

Gly Ser Gly Trp Leu Lys Gly Asp Thr pro Asp Tyr Pro Glu Gly Pro Val Asp Pro Gin 140 

TAT CCA GGA TTC ATG ACA AGA GGA ATA CTG GAT CCC AGA ACT TAC TAC TAC AGA CGA OTC. 480 

Tyr Pro Gly Phe Met Thr Arg Gly lie Leu Asp pro Arg Thr Tyr Tyr Tyr Arg Arg val 160 

TTC ACQ GAC GCT GTC AGA GCC GTT GAA GCT GCT GCT TCT TTT CCT CAG GTA GAT CAA GAA 540 

Phe Thr Asp Ala Val Arg Ala val Glu Ala Ala Ala Ser phe Pro Gin Val Asp Gin Glu 180 

AGA ATC GTG ATA GCT GGA GGC AGT CAG GGT GGC GGA ATA GCC CTT GCG GTG AGC GCT CTC 600 

Arg lie Val lie Ala Oly Gly Ser Gin Gly Gly Gly tie Ala Leu Ala Val Ser Ala Leu 200 

TCA AAG AAA GGA AAG GCT CTT CTG TGC GAT GTG COS TTT CTG TOT CAC TTC AGA AGA GGA 660 

Ser Lys Lys Ala Lys Ala Leu Leu Cys Asp Val Pro Phe Leu Cys His Phe Arg Arg Ala 220 

GTA CAG CTT GTG GAT ACQ CAT CCA TAC GCG GAG ATC ACG AAC TTT CTA AAG ACC CAC AGA 720 

Val Gin Leu Val Asp Thr His pro Tyr Ala Glu He Thr Asn Phe Leu Lys Thr His Arg 240 

GAC AAG GAA GAA ATC GTG TTC AGG ACT CTT TCC TAT TTC GAT GGA GTG AAC TTC GCA GCC 780 

Asp Lys Glu Glu He Val phe Arg Thr Leu ser Tyr Phe Asp Gly Val Asn Phe Ala Ala 260 

AGA GCG AAG ATC CCT GCG CTG TTT TCT GTG GGT CTC ATG GAC AAC ATT TGT CCT CCT TCA 840 

Arg Ala Lys He pro Ala Leu phe Ser Val Gly Leu Met Asp Asn He Cys Pro Pro Ser 280 

ACG GTT TTC GCT GCC TAC AAT TAC TAC GCT GGA COG AAG GAA ATC AGA ATC TAT COG TAC 900 

Thr Val Phe Ala Ala Tyr Asn Tyr Tyr Ala Gly Pro Lys Glu He Arg He Tyr Pro Tyr 300 

AAC AAC CAC GAG GGA GGA GGC TCT TTC CAA GCG GTT GAA CAG GTG AAA TTC TTG AAA AAA 960 

Asn Asn Bis Glu Gly Gly Gly Ser Phe Gin Ala Val Glu Gin Val Lys Phe Leu Lys Lys 320 

CTA TTT GAG AAA GGC TAA 978 

Leu Phe Glu Lys Gly 326 
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Figure 20 

Polyanglua brmchyepenm Bitirue 7fit&cl 

TTO AAG TAC TTC AAA GCC COO CTT GCC GGC ATC ACC TTO CTC COC CTO CIO CCC TGC ACC 60 

Leu Lyo Tyr Phe Lys Ala Arg Leu Ala Oly lie Thr Leu Leu Oly Leu Leu Ala Cys Thr 20 

TCG GCC TOO GCO CAQ ACC GAG CCC ATC GTO TTC 0T0 CAC GGC TAT TCC GGC AOC GCA TCC : 120 

Ser Ala. ser Ala Gin Thr Glu Pro Zle val- Phe Val His Oly Tyr Ser Gly ser Ala Ser • 40 

AAC TGC GAC ACC ATG CTO OOC CCC TTC COO TOO AAC 0OT TAT GCO TCC GCC TCC CTC TAC 180 

Asn Trp Asp Thr Mac Leu Gly Arg Phe Arg ser Asn Gly Tyr Ala Ser Gly Ser Leu Tyr , €0 

ACC TTC AAC TAC AAC TCO TTO GTC AGO ACC AAC CGC ACC AGC GCC AOC GAG CTO CGC AGC . 240 

Thr Phe Asn Tyr Asn Ser Leu Val Ser Ser Asn Arg Thr Ser Ala Ser Olu Leu Arg Ser SO 

TTC GTC AAC ACC GTO CGT TCO CGC CAC GGC AAC CCC OOC ATC GOG CTO GTC GCC CAC TCC 300 

Phe Val Asn Thr Val Arg Ser Arg His Gly Asn Ala Arg lie Ala Leu Val Ala His Ser 100 

AAC GGC GGG CTO GTO TOO COC TOO TAT COC GCO OAO CTO GGC GGC GAA ACG GCC ACC CGC 360 

Asn Oly Oly Leu Val Ser Arg Trp Tyr Arg Ala Olu Leu Gly Oly Olu Thr Ala Thr Arg 120 

CGC TTC GTO ACC CTO GGC ACG COO CAC CG0 OOC ACC ACC TOO CCC TAT GCO TGC TAC AGC 420 

Arg Phe Val Thr Leu Oly Thr Pro His Arg Oly Thr Thr Trp Ala Tyr Ala Cys Tyr Ser 140 

CCC OCX TOT TTC OAO ATG CCC CCC GOC TCC AGC TTO CTO ACC ACG CTO GGC TCG CGT GCC 400 

Pro Ala Cys Phe Glu Het Arg Pro Oly Ser Ser Leu Leu Thr Thr Leu Oly Ser Arg Ala 160 

TGC GAC COC TCO CTO TG0 TCO AAC ACC GAC GGC ATC ATC CTO CCQ GCO TCC AOC GCO CAO 540 

Cye Asp Arg Ser Leu Trp Ser Asn Thr Asp Oly lie tie Leu Pro Ala Ser Ser Ala Gin 180 

TOT OCT GTC AGC ACG CGC ACT GCC GAC GTC AGC CAT CTC GAC CTO CTO ACC GAC TCT CGC 600 

Cys Oly Val Ser Thr Arg Thr Ala Asp Val Ser His Leu Asp Leu Leu Thr Asp Ser Arg 200 

GTG TAC ACG CAQ TTO CGC ACG CAO TTO CAA TOA GGG TOA COO TGC ACC GAA COT GCA CCT 560 

Val Tyr Thr Gin Leu Arg Thr Gin Leu Gin Bad Oly End Arg Cys Thr Olu Arg Ala Pro 220 

a 661 
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